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Abstract 

At the Large Hadron Collider, the identification of jets originating from b quarks is 
important for searches for new physics and for measurements of standard model 
processes. A variety of algorithms has been developed by CMS to select b-quark 
jets based on variables such as the impact parameters of charged-particle tracks, the 
properties of reconstructed decay vertices, and the presence or absence of a lepton, or 
combinations thereof. The performance of these algorithms has been measured using 
data from proton-proton collisions at the LHC and compared with expectations based 
on simulation. The data used in this study were recorded in 2011 at y/s = 7TeV for a 
total integrated luminosity of 5.0 fb _1 . The efficiency for tagging b-quark jets has been 
measured in events from multijet and t-quark pair production. CMS has achieved a 
b-jet tagging efficiency of 85% for a light-parton misidentification probability of 10%. 
For analyses requiring higher purity, a misidentification probability of only 1.5% has 
been achieved, for a 70% b-jet tagging efficiency. 
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1 Introduction 

Jets that arise from bottom-quark hadronization (b jets) are present in many physics processes, 
such as the decay of top quarks, the Higgs boson, and various new particles predicted by super- 
symmetric models. The ability to accurately identify b jets is crucial in reducing the otherwise 
overwhelming background to these channels from processes involving jets from gluons (g) and 
light-flavour quarks (u, d, s), and from c-quark fragmentation. 

The properties of the bottom and, to a lesser extent, the charm hadrons can be used to identify 
the hadronic jets into which the b and c quarks fragment. These hadrons have relatively large 
masses, long lifetimes and daughter particles with hard momentum spectra. Their semileptonic 
decays can be exploited as well. The Compact Muon Solenoid (CMS) detector, with its precise 
charged-particle tracking and robust lepton identification systems, is well matched to the task 
of b-jet identification (b-jet tagging). The first physics results using b-jet tagging have been 
published HHSI from the first data samples collected at the Large Hadron Collider (LHC). 

This paper describes the b-jet tagging algorithms used by the CMS experiment and measure- 
ments of their performance. A description of the apparatus is given in Section |2j The event 
samples and simulation are discussed in Section|3] The algorithms for b-jet tagging are defined 
in Section [4] The distributions of the relevant observables are compared between simulation 
and proton-proton collision data collected in 2011 at a centre-of-mass energy of 7TeV. The 
robustness of the algorithms with respect to running conditions, such as the alignment of the 
detector elements and the presence of additional collisions in the same bunch crossing (pileup), 
is also discussed. 

Physics analyses using b-jet identification require the values of the efficiency and misidentifica- 
tion probability of the chosen algorithm, and, in general, these are a function of the transverse 
momentum (pj) and pseudorapidity (rj) of a jet. They can also depend on parameters such as 
the efficiency of the track reconstruction, the resolution of the reconstructed track parameters, 
or the track density in a jet. While the CMS simulation reproduces the performance of the de- 
tector to a high degree of precision, it is difficult to model all the parameters relevant for b-jet 
tagging. Therefore it is essential to measure the performance of the algorithms directly from 
data. These measurements are performed with jet samples that are enriched in b jets, either 
chosen by applying a discriminating variable on jets in multijet events or by selecting jets from 
top-quark decays. The methods that are used to measure the performance are described in Sec- 
tions |5]and|6] The measurements are complementary: multijet events cover a wider range in pj, 
while the results obtained from tt events are best suited for some studies of top-quark physics. 
The efficiency measurements are summarized and compared in Section|7| The measurement of 
the misidentification probability of light-parton (u, d, s, g) jets as b jets in the data is presented 
in Section [8] 

2 The CMS detector 

The central feature of the CMS apparatus is a superconducting solenoid, of 6 m internal diam- 
eter, which provides a magnetic field of 3.8 T. Within the field volume are the silicon tracker, 
the crystal electromagnetic calorimeter, and the brass/ scintillator hadron calorimeter. Muons 
are measured in gas-ionization detectors embedded in the steel return yoke. 

CMS uses a right-handed coordinate system, with the origin at the nominal interaction point, 
the x axis pointing to the centre of the LHC ring and the z axis along the counterclockwise-beam 
direction. The polar angle, 9, is measured from the positive z axis and the azimuthal angle, (p, 
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is measured in the x-y plane. The pseudorapidity is defined as f] = — ln[tan(0/2)]. A more 
detailed description of the CMS detector can be found elsewhere [4J. 

The most relevant detector elements for the identification of b jets and the measurement of algo- 
rithm performance are the tracking system and the muon detectors. The inner tracker consists 
of 1440 silicon pixel and 15 148 silicon strip detector modules. It measures charged particles up 
to a pseudorapidity of \t]\ < 2.5. The pixel modules are arranged in three cylindrical layers in 
the central part of CMS and two endcap disks on each side of the interaction point. The sili- 
con strip detector comprises two cylindrical barrel detectors with a total of 10 layers and two 
endcap systems with a total of 12 layers at each end of CMS. The tracking system provides an 
impact parameter (IP) resolution of about 15 pm at a pj of lOOGeV/c. In comparison typical 
IP values for tracks from b-hadron decays are at the level of a few 100 pm. Muons are mea- 
sured and identified in detection layers that use three technologies: drift tubes, cathode-strip 
chambers, and resistive-plate chambers. The muon system covers the pseudorapidity range 
\rj\ < 2.4. The combination of the muon and tracking systems yields muon candidates of high 
purity with a pj resolution between 1 and 5%, for pj values up to 1 TeV/c. 

3 Data samples and simulation 

Samples of inclusive multijet events for the measurement of efficiencies and misidentification 
probabilities were collected using jet triggers with pj thresholds of 30 to 300GeV/c. For effi- 
ciency measurements, dedicated triggers were used to enrich the data sample with jets from 
semimuonic b-hadron decays. These triggers required the presence of at least two jets with 
pj thresholds ranging from 20 to HOGeV/c. One of these jets was required to include a muon 
with pj > 5GeV/c within a cone of AR = 0.4 around the jet axis, where AR is defined as 
a/ (A^) 2 + (A77) 2 . Triggers with low-px thresholds were prescaled in order to limit the over- 
all trigger rates. Depending on the prescale applied to the trigger, the multijet analyses used 
datasets with integrated luminosities of up to 5.0 fb _1 . 

Data for the analysis of tt events were collected with single- and double-lepton (e and p) trig- 
gers. The samples were collected in the first part of the 2011 data taking with an integrated 
luminosity of 2.3 fb _1 . The precision on the b-jet tagging efficiency from tf events is limited by 
systematic uncertainties. Using the full dataset collected in 2011 would not reduce the overall 
uncertainty. 

Monte Carlo (MC) simulated samples of multijet events were generated with pythia 6.424 (51 
using the Z2 tune [6 |. For b-jet tagging efficiency studies, dedicated multijet samples have been 
produced with the explicit requirement of a muon in the final state. 

In the simulation, a reconstructed jet is matched with a generated parton if the direction of 
the parton is within a cone of radius AR = 0.3 around the jet axis. The jet is then assigned 
the flavour of the parton. Should more than one parton be matched to a given jet, the flavour 
assigned is that of the heaviest parton. The b flavour is given priority over the c flavour, which 
in turn is given priority over light partons. According to this definition jets originating from 
gluon splitting to bb, which constitute an irreducible background for all tagging algorithms, 
are classified as b jets. 

Events involving tt production were simulated using the MADGRAPH event generator 
(v. 5.1.1.0), where the top quark pairs were generated with up to four additional partons in 
the final state. A top quark mass of m t = 172.5 GeV/c 2 was assumed. The parton configu- 
rations generated by MadGraph were processed with Pythia to provide showering of the 
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generated particles. The soft radiation was matched with the contributions from the matrix 
element computation using the fcj-MLM prescription [8j. The tau-lepton decays were handled 
with TAUOLA (v. 27.121.5) 0. 

The electroweak production of single top quarks is considered as a background process for 
analyses using tf events, and was simulated using POWHEG 301 [10J . The production of W/Z 
+ jets events, where the vector boson decays leptonically has a signature similar to tt and 
constitutes the main background. These events were simulated using MadGraph +PYTHIA, 
with up to four additional partons in the final state. The bottom and charm components are 
separated from the light-parton components in the analysis by matching reconstructed jets to 
partons in the simulation. 

Signal and background processes used in the analysis of tt events were normalized to next-to- 
leading-order (NLO) and next-to-next-to-leading-order (NNLO) cross sections, with the excep- 
tion of the QCD background. 

The top-quark pair production NLO cross section was calculated to be a t i = 157+24 pb, using 
MCFM [11 j. The uncertainty in this cross section includes the scale uncertainties, estimated by 
varying simultaneously the factorization and renormalization scales by factors of 0.5 or 2 with 
respect to the nominal scale of (2m t ) 2 + Pj art ° n ) 2 , where p^. arton are the transverse momenta 
of the partons in the event. The uncertainties from the parton distribution functions (PDF) and 
the value of the strong coupling constant 0.$ were estimated following the procedures from the 
MSTW2008 [la, CTEQ6.6 HU, and NNPDF2.0 sets. The uncertainties were then combined 
according to the PDF4LHC prescriptions |IT5| . 

The i-channel single top NLO cross section was calculated to be cr t = 64.6+3 2 pb using MCFM ffTTl 
IT61H8} . The uncertainty was evaluated in the same way as for top-quark pair production. The 
single top-quark associated production (tW) cross section was set to o~w = 15.7 ± 1.2 pb ||19|. 
The s-channel single top-quark next-to-next-to-leading-log (NNLL) cross section was deter- 
mined to be a s = 4.6 ± 0.1 pb 11201 . 

The NNLO cross section of the inclusive production of W bosons multiplied by its branching 
fraction to leptons was determined to be 0\y-y&/ = 31.3 ± 1.6 nb using FEWZ J2l|, setting the 
renormalization and factorization scales to (%) 2 + (EPj') 2 with % = 80.398 GeV/c 2 . The 
uncertainty was determined in the same way as in top-quark pair production. The normaliza- 
tions of the W+b jets and W+c jets components were determined in a measurement of the top 
pair production cross section in the lepton+jet channel j|22"| , where a simultaneous fit of the tt 
cross section and the normalization of the main backgrounds was performed. 

The Drell-Yan production cross section at NNLO was calculated using FEWZ as <7 Z / 7 *^^(w% > 
20 GeV) = 5.00 ± 0.27 nb, where mn is the invariant mass of the two leptons and the scales were 
set using the Z boson mass m z = 91.1876 GeV/c 2 p3| . 

All generated events were passed through the full simulation of the CMS detector based on 
Geant4 [24J. The samples were generated with a different pileup distribution than that ob- 
served in the data. The simulated events were therefore reweighted to match the observed 
pileup distribution. 

4 Algorithms for b-jet identification 

A variety of reconstructed objects - tracks, vertices and identified leptons - can be used to 
build observables that discriminate between b and light-parton jets. Several simple and ro- 
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bust algorithms use just a single observable, while others combine several of these objects to 
achieve a higher discrimination power. Each of these algorithms yields a single discriminator 
value for each jet. The minimum thresholds on these discriminators define loose ("L"), medium 
("M"), and tight ("T") operating points with a misidentification probability for light-parton jets 
of close to 10%, 1%, and 0.1%, respectively, at an average jet px of about 80GeV/c. Through- 
out this paper, the tagging criteria will be labelled with the letter characterizing the operating 



point appended to the acronym of one of the algorithms described in Sections 4.2 and 4.3 The 
application of such a tagging criterion will be called a "tagger". 

After a short description of the reconstructed objects used as inputs, details on the tagging al- 
gorithms are given in the following subsections, proceeding in order of increasing complexity. 
Muon-based b-jet identification is mainly used as a reference method for performance mea- 
surements. It is described in more detail in SectionH] 

4.1 Reconstructed objects used in b-jet identification 

Jets are clustered from objects reconstructed by the particle-flow algorithm Il25ll26l . This al- 
gorithm combines information from all subdetectors to create a consistent set of reconstructed 
particles for each event. The particles are then clustered into jets using the anti-fcj clustering 
algorithm ||27| with a distance parameter of 0.5. The raw jet energies are corrected to obtain a 
uniform response in 77 and an absolute calibration in pj [28] . Although particle-flow jets are 
used as the default, the b-jet tagging algorithms can be applied to jets clustered from other 
reconstructed objects. 

Each algorithm described in the next section uses the measured kinematic properties of charged 
particles, including identified leptons, in a jet. The trajectories of these particles are recon- 
structed in the CMS tracking system in an iterative procedure using a standard Kalman filter- 
based method. Details on the pattern recognition, the track-parameter estimation, and the 
tracking performance in proton-proton collisions can be found in Refs. 



A "global" muon reconstruction, using information from multiple detector systems, is achieved 
by first reconstructing a muon track in the muon chambers. This is then matched to a track 
measured in the silicon tracker ||30|. A refit is then performed using the measurements on both 
tracks. 

Primary vertex candidates are selected by clustering reconstructed tracks based on the z co- 
ordinate of their closest approach to the beam line. An adaptive vertex fit [31 j is then used 
to estimate the vertex position using a sample of tracks compatible with originating from the 
interaction region. Among the primary vertices found in this way, the one with the highest 
yj(ptf ack ) 2 is selected as a candidate for the origin of the hard interaction, where the pj ack are 
the transverse momenta of the tracks associated to the vertex. 

The b-jet tagging algorithms require a sample of well-reconstructed tracks of high purity. Spe- 
cific requirements are imposed in addition to the selection applied in the tracking step. The 
fraction of misreconstructed or poorly reconstructed tracks is reduced by requiring a trans- 
verse momentum of at least 1 GeV/ c. At least eight hits must be associated with the track. To 
ensure a good fit, xVn.d.o.f. < 5 is required, where n.d.o.f. stands for the number of degrees 
of freedom in the fit. At least two hits are required in the pixel system since track measure- 
ments in the innermost layers provide most of the discriminating power. A loose selection on 
the track impact parameters is used to further increase the fraction of well-reconstructed tracks 
and to reduce the contamination by decay products of long-lived particles, e.g. neutral kaons. 
The impact parameters d xy and d z are defined as the transverse and longitudinal distance to 
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the primary vertex at the point of closest approach in the transverse plane. Their absolute val- 
ues must be smaller than 0.2 cm and 17 cm, respectively. Tracks are associated to jets in a cone 
AR < 0.5 around the jet axis, where the jet axis is defined by the primary vertex and the di- 
rection of the jet momentum. The distance of a track to the jet axis is defined as the distance 
of closest approach of the track to the axis. In order to reject tracks from pileup this quantity 
is required to be less than 700 }im. The point of closest approach must be within 5 cm of the 
primary vertex. This sample of associated tracks is the basis for all algorithms that use impact 
parameters for discrimination. 

Properties of the tracks and the average multiplicity after the selection (except for the vari- 
able plotted) are shown in Fig. [T] The uncertainties shown in this and all following figures 
are statistical unless otherwise stated. The data were recorded with a prescaled jet trigger in 
the second part of 2011 when the number of pileup events was highest. The jet pj threshold 
was 60GeV/c. The distributions show satisfactory agreement with the expectations from sim- 
ulation. The track multiplicity and the lower part of the momentum spectrum are particularly 
sensitive to the modelling of the particle multiplicity and kinematics by the Monte Carlo gener- 
ator, as are other distributions such as the number of hits in the innermost pixel layers. Detector 
effects that are not modelled by the simulation, such as the dynamic readout inefficiency in the 
pixel system, can also contribute to the remaining discrepancies. In Fig. [I] and the following 
figures, simulated events with gluon splitting to bb are shown as a special category. The b jets 
in these events tend to be close in space and can be inadvertently merged by the clustering 
algorithm, resulting in a higher average track multiplicity per jet. 

The combinatorial complexity of the reconstruction of the decay points (secondary vertices) of 
b or c hadrons is more challenging in the presence of multiple proton-proton interactions. In 
order to minimize this complexity a different track selection is applied. Tracks must be within 
a cone of AR = 0.3 around the jet axis with a maximal distance to this axis of 0.2 cm and pass 
a "high-purity" criterion [32|. The "high-purity" criterion uses the normalized % 2 of the track 
fit, the track length, and impact parameter information to optimize the purity for each of the 
iterations in track reconstruction. The vertex finding procedure begins with tracks defined by 
this selection and proceeds iteratively. A vertex candidate is identified by applying an adaptive 
vertex fit |31], which is robust in the presence of outliers. The fit estimates the vertex position 
and assigns a weight between and 1 to each track based on its compatibility with the vertex. 
All tracks with weights > 0.5 are then removed from the sample. The fit procedure is repeated 
until no new vertex candidate can be found. In the first iteration the interaction region is used as 
a constraint in order to identify the prompt tracks in the jet. The subsequent iterations produce 
decay vertex candidates. 

4.2 Identification using track impact parameters 

The impact parameter of a track with respect to the primary vertex can be used to distinguish 
the decay products of a b hadron from prompt tracks. The IP is calculated in three dimensions 
by taking advantage of the excellent resolution of the pixel detector along the z axis. The im- 
pact parameter has the same sign as the scalar product of the vector pointing from the primary 
vertex to the point of closest approach with the jet direction. Tracks originating from the decay 
of particles travelling along the jet axis will tend to have positive IP values. In contrast, the 
impact parameters of prompt tracks can have positive or negative IP values. The resolution 
of the impact parameter depends strongly on the pj and rj of a track. The impact parameter 
significance Sjp, defined as the ratio of the IP to its estimated uncertainty, is used as an observ- 
able. The distributions of IP values and their significance are shown in Fig. [2] In general, good 
agreement with simulation is observed with the exception of a small difference in the width of 
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Figure 1: Track properties after basic selection (except for the variable plotted): (a) number of 
hits in the pixel system, (b) transverse momentum, (c) distance to the jet axis. The average 
number of tracks passing the basic selection is shown in (d) as a function of the transverse mo- 
mentum of the jet. In (a)-(c) the distributions from simulation have been normalized to match 
the counts in data. The filled circles correspond to data. The stacked, coloured histograms 
indicate the contributions of different components from simulated multijet ("QCD") samples. 
Simulated events involving gluon splitting to b quarks ("b from gluon splitting") are indicated 
separately from the other b production processes ("b quark"). In each histogram, the rightmost 
bin includes all events from the overflow. The sample corresponds to a trigger selection with 
jet p T > 60GeV/c. 



the core of the IP significance distribution. 

By itself the impact parameter significance has discriminating power between the decay prod- 
ucts of b and non-b jets. The Track Counting (TC) algorithm sorts tracks in a jet by decreasing 
values of the IP significance. Although the ranking tends to bias the values for the first track to 
high positive IP significances, the probability to have several tracks with high positive values is 
low for light-parton jets. Therefore the two different versions of the algorithm use the IP signif- 
icance of the second and third ranked track as the discriminator value. These two versions of 
the algorithm are called Track Counting High Efficiency (TCHE) and Track Counting High Purity 
(TCHP), respectively. The distribution of the TCHE discriminator is shown in Fig.|3](a). 

A natural extension of the TC algorithms is the combination of the IP information of several 
tracks in a jet. Two discriminators are computed from additional algorithms. The Jet Probabil- 
ity (JP) algorithm uses an estimate of the likelihood that all tracks associated to the jet come 
from the primary vertex. The Jet B Probability (JBP) algorithm gives more weight to the tracks 
with the highest IP significance, up to a maximum of four such tracks, which matches the av- 
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Figure 2: Distributions of (a) the 3D impact parameter and (b) the significance of the 3D impact 
parameter for all selected tracks. Selection and symbols are the same as in Fig. [I] Underflow 
and overflow are added to the first and last bins, respectively. 

erage number of reconstructed charged particles from b-hadron decays. The estimate for the 
likelihood, Pj e t, is defined as 



V = n • E - — with 

1=0 l - !=1 



n = P^max(P,-, 0.005) , 



(1) 



where N is the number of tracks under consideration and P, is the estimated probability for 
track i to come from the primary vertex Il33"ll3"4l . The P; are based on the probability density 
functions for the IP significance of prompt tracks. These functions are extracted from data for 
different track quality classes, using the shape of the negative part of the Sn> distribution. The 
cut-off parameter for P, at 0.5% limits the effect of single, poorly reconstructed tracks on the 
global estimate. The discriminators for the jet probability algorithms have been constructed to 
be proportional to — In Pj et . The distribution of the JP discriminator in data and simulation is 
shown in Fig. |3](b). 
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Figure 3: Discriminator values for (a) the TCHE and (b) the JP algorithms. Selection and sym- 
bols are the same as in Fig. [T] The small discontinuities in the JP distributions are due to the 
single track probabilities which are required to be greater than 0.5%. 



4.3 Identification using secondary vertices 

The presence of a secondary vertex and the kinematic variables associated with this vertex 
can be used to discriminate between b and non-b jets. Two of these variables are the flight 
distance and direction, using the vector between primary and secondary vertices. The other 



8 



4 Algorithms for b-jet identification 



CMS2011,\/s = 7TeV CMS2011,\/s = 7TeV 




3D SV flight distance significance SV mass [GeV/c 2 ] 



Figure 4: Properties of reconstructed decay vertices: (a) the significance of the 3D secondary 
vertex (3D SV) flight distance and (b) the mass associated with the secondary vertex. Selection 
and symbols are the same as in Fig. [TJ 

variables are related to various properties of the system of associated secondary tracks such 
as the multiplicity the mass (assuming the pion mass for all secondary tracks), or the energy. 
Secondary-vertex candidates must meet the following requirements to enhance the b purity: 

• secondary vertices must share less than 65% of their associated tracks with the pri- 
mary vertex and the significance of the radial distance between the two vertices has 
to exceed 3a; 

• secondary vertex candidates with a radial distance of more than 2.5 cm with respect 
to the primary vertex, with masses compatible with the mass of K° or exceeding 
6.5GeV/c 2 are rejected, reducing the contamination by vertices corresponding to 
the interactions of particles with the detector material and by decays of long-lived 
mesons; 

• the flight direction of each candidate has to be within a cone of AR < 0.5 around the 
jet direction. 

The Simple Secondary Vertex (SSV) algorithms use the significance of the flight distance (the 
ratio of the flight distance to its estimated uncertainty) as the discriminating variable. The 
algorithms' efficiencies are limited by the secondary vertex reconstruction efficiency to about 
65%. Similar to the Track Counting algorithms, there exist two versions optimized for different 
purity: the High Efficiency (SSVHE) version uses vertices with at least two associated tracks, 
while for the High Purity (SSVHP) version at least three tracks are required. In Fig. [4] the flight 
distance significance and the mass associated with the secondary vertex are shown. 

A more complex approach involves the use of secondary vertices, together with track-based 
lifetime information. By using these additional variables, the Combined Secondary Vertex (CSV) 
algorithm provides discrimination also in cases when no secondary vertices are found, increas- 
ing the maximum efficiency with respect to the SSV algorithms. In many cases, tracks with 
an Sip > 2 can be combined in a "pseudo vertex", allowing for the computation of a subset 
of secondary-vertex-based quantities even without an actual vertex fit. When even this is not 
possible, a "no vertex" category reverts to track-based variables that are combined in a way 
similar to that of the JP algorithm. 

The following set of variables with high discriminating power and low correlations is used (in 
the "no vertex" category only the last two variables are available): 

• the vertex category (real, "pseudo," or "no vertex"); 
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CMS2011,\/s = 7TeV CMS2011, is = 7 TeV 
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Figure 5: Distributions of (a) the secondary vertex multiplicity and (b) the CSV discriminator. 
Selection and symbols are the same as in Fig.JT] 

• the flight distance significance in the transverse plane ("2D"); 

• the vertex mass; 

• the number of tracks at the vertex; 

• the ratio of the energy carried by tracks at the vertex with respect to all tracks in the 
jet; 

• the pseudorapidities of the tracks at the vertex with respect to the jet axis; 

• the 2D IP significance of the first track that raises the invariant mass above the charm 
threshold of 1.5GeV/c 2 (tracks are ordered by decreasing IP significance and the 
mass of the system is recalculated after adding each track); 

• the number of tracks in the jet; 

• the 3D IP significances for each track in the jet. 

Two likelihood ratios are built from these variables. They are used to discriminate between b 
and c jets and between b and light-parton jets. They are combined with prior weights of 0.25 
and 0.75, respectively. The distributions of the vertex multiplicity and of the CSV discriminator 
are presented in Fig. [5] 

4.4 Performance of the algorithms in simulation 

The performance of the algorithms described above is summarized in Fig. [6] where the predic- 
tions of the simulation for the misidentification probabilities (the efficiencies to tag non-b jets) 
are shown as a function of the b-jet efficiencies. For loose selections with 10% misidentification 
probability for light-parton jets a b-jet tagging efficiency of ~ 80-85% is achieved. In this re- 
gion the JBP has the highest b-jet tagging efficiency. For tight selections with misidentification 
probabilities of 0.1%, the typical b-jet tagging efficiency values are ~ 45-55%. For medium 
and tight selections the CSV algorithm shows the best performance. As can be seen in Fig. |6j 
the TC and SSV algorithms cannot be tuned to provide good performance for the whole range 
of operating points. Therefore two versions of these algorithms are provided, with the "high 
efficiency" version to be used for loose to medium operating points and the "high purity" ver- 
sion for tighter selections. Because of the non-negligible lifetime of c hadrons the separation 
of c from b jets is naturally more challenging. Due to the explicit tuning of the CSV algorithm 
for light-parton- and c-jet rejection it provides the best c-jet rejection values in the high-purity 
region. 
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Figure 6: Performance curves obtained from simulation for the algorithms described in the text, 
(a) light-parton- and (b) c-jet misidentification probabilities as a function of the b-jet efficiency. 



4.5 Impact of running conditions on b-jet identification 

All tagging algorithms rely on a high track identification efficiency and a reliable estimation 
of the track parameters and their uncertainties. These are both potentially sensitive to changes 
in the running conditions of the experiment. The robustness of the algorithms with respect to 
the misalignment of the tracking system and an increase in the density of tracks due to pile up, 
which are the most important of the changes in conditions, has been studied. 

The alignment of the CMS tracker is performed using a mixture of tracks from cosmic rays and 
minimum bias collisions [35. 36 1, and is regularly monitored. During the 2011 data taking, the 
most significant movements were between the two halves of the pixel barrel detector, where 
discrete changes in the relative z position of up to 30 f/m were observed. The sensitivity of b-jet 
identification to misalignment was studied on simulated tt samples. With the current estimated 
accuracy of the positions of the active elements, no significant deterioration is observed with 
respect to a perfectly aligned detector. The effect of displacements between the two parts of 
the pixel barrel detector was studied by introducing artificial separations of 40, 80, 120, and 
160 jinx in the detector simulation. The movements observed in 2011 were not found to cause 
any significant degradation of the performance. 

Because of the luminosity profile of the 2011 data, the number of proton collisions taking place 
simultaneously in one bunch crossing was of the order of 5 to 20 depending on the time period. 
Although these additional collisions increase the total number of tracks in the event, the track 
selection is able to reject tracks from nearby primary vertices. The multiplicity distribution of 
selected tracks is almost independent of the number of primary vertices, as shown in Fig. [7] (a). 
There is an indication of a slightly lower tracking efficiency in events with high pileup. The 
rejection of the additional tracks is mainly due to the requirement on the distance of the tracks 
with respect to the jet axis. This selection criterion is very efficient for the rejection of tracks 
from pileup. The reconstruction of track parameters is hardly affected. The distribution of the 
second-highest IP significance is stable, as shown in Fig.[7|(b). The impact of high pileup on the 
b-jet tagging performance is illustrated in Fig. [HJ This shows the light-parton misidentification 
probability versus the b-jet tagging efficiency for the TCHP and SSVHP algorithms. In order 
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Figure 7: (a) the number of tracks associated with the selected jets for three ranges of primary 
vertex (PV) multiplicity, (b) the IP significance of the second-most significant track, for the three 
ranges of primary vertex multiplicity. The distributions are normalized to the event count for 
1-6 PV range. Underflow and overflow entries are added to the first and last bins, respectively. 
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Figure 8: Light-parton misidentification probability versus b-jet tagging efficiency for the (a) 
TCHP and (b) SSVHP algorithms for different pileup (PU) scenarios. 

to focus on the changes due to the b-jet tagging algorithms, the performance curves have been 
compared using a jet pj threshold of 60 GeV/c at the generator level. The changes are small and 
concentrated in the regions of very high purity. 



5 Efficiency measurement with multijet events 

For the b-jet tagging algorithms to be used in physics analyses, it is crucial to know the effi- 
ciency for each algorithm to select genuine b jets. There are a number of techniques that can be 
applied to CMS data to measure the efficiencies in situ, and thus reduce the reliance on simula- 
tions. If event distributions from MC simulation match those observed in data reasonably well, 
then the simulation can be used for a wide range of topologies after applying corrections deter- 
mined from specific data samples. Corrections can be applied to simulated events using a scale 
factor Sfb, defined as the ratio of the efficiency measured with collision data to the efficiency 
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found in the equivalent simulated samples, using MC generator-level information to identify 
the jet flavour. Furthermore, the measurement techniques used for data are also applied to the 
simulation in order to validate the different algorithms. 

Some efficiency measurements are performed using samples that include a jet with a muon 
within AR = 0.4 from the jet axis (a "muon jet"). Because the semileptonic branching fraction 
of b hadrons is significantly larger than that for other hadrons (about 11%, or 20% when b — > 
c — > I cascade decays are included), these jets are more likely to arise from b quarks than 
from another flavour. Muons are identified very efficiently in the CMS detector, making it 
straightforward to collect samples of jets with at least one muon. These muons can be used to 
measure the performance of the lifetime-based tagging algorithms, since the efficiencies of the 
muon- and lifetime-based b-jet identification techniques are largely uncorrelated. Sections |5.1 



and 5.2 describe efficiency measurements that use muon jets, while the technique of Section 



5.3 



makes use of a more generic dijet sample. The results are given in Section[7[ 

5.1 Efficiency measurement with kinematic properties of muon jets 

Due to the large b-quark mass, the momentum component of the muon transverse to the jet 
axis, Pj, is larger for muons from b-hadron decays than for muons in light-parton jets or from 
charm hadrons. This component is used as the discriminant for the "PtRel" method. In addi- 
tion, the impact parameter of the muon track, calculated in three dimensions, is also larger for 
b hadrons than for other hadrons. This parameter is used as the discriminant for the "IP3D" 
method. Both of these variables can thus be used as a discriminant in the b-jet tagging efficiency 
determination. In both cases, the discriminating power of the variable depends on the muon jet 
pj. The muon pJp 1 (IP) distributions provide better separation for jets with pj smaller (greater) 
than about 120 GeV/c. The PtRel and IP3D methods rely on fits to the pf l fl3ZD and muon IP dis- 
tributions in the data with respect to simulated spectra for the b signal and charm+light-parton 
background. 

In the two methods, the p!j? 1 and IP spectra for muon jets are modelled using simulated distri- 
butions that represent the spectra expected for different jet flavours to obtain the b-jet content 
of the sample. The efficiency for a particular tagger is obtained by measuring the fraction of 
muon jets that satisfy the requirements of the tagger. To make the treatment of the statistical 
uncertainty more straightforward, the muon jet sample is separated into those jets that satisfy 
and those that fail the requirements of the tagger. These jets are referred to as "tagged" and 
"untagged." 

A dijet sample with high b-jet purity is obtained by requiring that events have exactly two re- 
constructed jets: the muon jet as defined above and another jet fulfilling the TCHPM b-jet tag- 
ging criterion (the "medium" operating point for the TCHP algorithm). Simulated MC events 
are used to establish p^ el and IP spectra for muon jets resulting from the fragmentation of b, 
c, and light partons. Muons in light-parton jets mostly arise from the decay of charged pions 
or kaons and from misidentified muons or hadronic punch-through in the calorimeters, effects 
that might not be modelled well in the simulation. The spectra for light-parton jets from sim- 
ulation can be validated against control samples of collision data. In Fig.|9]the distributions of 
Pj 1 and In (| IP | [cm]) derived from the simulation are compared to the ones obtained for tracks 
in inclusive jet data by applying the same kinematic selection and track reconstruction quality 
requirements as for the muon candidates. In order to measure the ability of the simulation to 
model the investigated spectra, we apply the same procedure to a sample of simulated inclu- 
sive jet events. The spectra derived for low-px muons from light-parton jets in simulation are 
corrected by multiplying them with the ratio of shapes of the inclusive distributions obtained 
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in data and simulation on a bin-by-bin basis. 



CMS, 5.0 tb" 1 at \[s = 7 TeV CMS, 5.0 fb" 1 at \ls = 7 TeV 




p'*' [GeV/c] ln(|IP|[cm]) 



Figure 9: Comparison of distributions of (a) muon pj for jets with pj between 80 and 120 GeV/c 
and (b) ln(|IP|[cm]) for jets with pj between 160 and 320GeV/c for muons in simulated light- 
parton jets ("MC light"), tracks from simulated inclusive jet events ("any tracks"), tracks from 
data, and muons in simulated light-parton jets after corrections based on data ("MC light 
scaled"). 



The fractions of each jet flavour in the dijet sample are extracted with a binned maximum like- 
lihood fit using pip 1 and IP templates for b, c and light-parton jets derived from simulation or 
inclusive jet data. The fits are performed independently in the tagged and untagged subsam- 



ples of the muon jets. Results of representative fits are shown in Figs. 10 and 11 
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Figure 10: Fits of the summed b and non-b templates, for simulated muon jets, to the muon 
Pj 1 distributions from data, (a) and (c) show the results for muon jets that pass (tagged) or 
fail (untagged) the b-jet tagging criteria of the JPL method, respectively, (b) and (d) are the 
equivalent plots for the CSVM method. The muon jet pj is between 80 and 120 GeV/c. 
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CMS, 5.0 fb" 1 at sS = 7 TeV 
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Figure 11: Same as Fig. 10 using the In ( | IP | [cm]) distributions. The muon jet pj is between 160 
and320GeV/c. 



From each fit the fractions of b jets (f h g , f^" tae ) are extracted from the data. With these fractions 
and the total yields of tagged and untagged muon jets (NjJL, N^^" 8 ), the number of b jets in 
these samples are calculated, and the efficiency e^ 8 for tagging b jets in the data is inferred: 



tag 



rtag . N tag 
Jb JV data 



rtag tag fU 
Jb iV data ^ Jb 



untag - N™ 1 ^ 



(2) 



data 



To obtain SF^, the efficiency for tagging b jets in the simulation is obtained from jets that have 
been identified as b jets with MC generator-level matching. 

5.2 Efficiency measurement with the System8 method 



The "System8" method |38j (39J is applied to events with a muon jet and at least one other, 
"away-tag", jet. The muon jet is used as a probe. The reference lifetime tagger and a supple- 
mentary pj el -based selection are tested on this jet. The away-tag jet is tested with a separate 
lifetime tagger. There are eight quantities that can be counted from the full data sample. The 
quantities depend on the number of passing or failing tags. A set of equations correlates these 
eight quantities with the tagging efficiencies. 

A muon jet can be tagged as a b jet using either a lifetime tagger, or by requiring that the muon 
has large p r j l . In this analysis, the requirement is pj 1 > 0.8GeV/c. These two tagging criteria 
have efficiencies e^ 8 and e£ tRel , respectively, for b jets. The third tagging criterion is the require- 
ment that another jet in the event passes also a lifetime-based tagger. This last requirement 
defines the "away-tag sample". It enriches the b content of the events, and thus makes it more 
likely that the muon jet is a b jet. Correlations between the efficiencies of the two tagging crite- 
ria are estimated from simulation. As p T j provides less discrimination between jet flavours at 
higher jet energies, the System8 method loses sensitivity for jet pi > 120 GeV/c. 
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With these criteria eight quantities are measured. The four quantities for the muon jets are: 
the total number of muon jets in the sample ft, the number of muon jets that pass the lifetime- 
tagging criterion n tag , the number of muon jets that pass the p r f 1 requirement n PtRel , and the 
number of muon jets that pass both criteria n ta S' PtRel . Likewise, the four quantities for the 
away-tag sample are labelled p, p tag , p ptRel , p ta & PtRel . The away-tag jets are tagged with the 
TCHPL criterion. 

The full muon jet sample, ft, and the away-tag sample, p, are each composed of an unknown 
mix of b and non-b jets. The non-b jets are labelled "c£". The muon sample thus comprises 
ti\, and n c £, and the away-tag sample, p\, and p c ^. The efficiencies of the two tagging criteria 
on b jets (e^ 8 , £^ tRel ) and on non-b jets (e^ 8 , £g} Rel ) are also unknown, for a total of eight un- 
known quantities. Thus, a system of eight equations can be written that relates the measurable 
quantities to the unknowns: 

n = n h + n c£ , 
V = Vb + Pee / 

n tag = ^ + f g ncgt 

p tag = i gtag e teg pb + a tag £ tag^ / (3) 

n PtRel = £ PtRel nb + £ PtRel nrf/ 

p PtRel = ^Pa M£ P tMpb + a PKeI e PtRel prf/ 

n tag, PTrel = ^ e tag e PtRel nb + a , £ tag £ PtRel nc ^ 

p tag /PTrel = ^ Pe tag e P t Rel pb + aP£ tag £ PtRel prf _ 

The method assumes that the efficiencies for a combination of tagging criteria are factorizable. 
Thus eight correlation factors are introduced to solve the system of equations: a tag , /3 tag , a PtRel , 
)S ptRel , a", p n , «?, and /3P. These factors are obtained from the simulation as a function of the 
muon jet pj and \rj\. The factors a and /3 are determined for non-b and b jets, respectively. The 
superscripts "tag" and "PtRel" of a. and /3 indicate the efficiency ratio of the p to the n samples 
for the lifetime and p^ 1 criteria. The superscripts "n" and "p" refer to the correlation between 
the two tagging efficiencies, "tag" and "PtRel", in the n and p samples. 

The simulation predicts that the correlation coefficients typically range between 0.95 and 1.05 
for those associated with the b-jet tagging efficiencies, and between 0.7 and 1.2 for those associ- 
ated with the c+£-tagging efficiencies. A numerical computation is applied to solve the system 
of eight equations in the data to determine the eight unknowns, thus simultaneously determin- 
ing the tagging efficiencies and flavour contents of both the full and away-tag samples. 

5.3 Efficiency measurement using a reference lifetime algorithm 

While muon pip 1 provides less discrimination power between jet flavours at large jet pj, the 



lifetime-based algorithms described in Sections [12] and [43] (TCHE, TCHP, JP, JBP, SSVHE, 
SSVHP and CSV) retain their sensitivity to distinguish different jet flavours. In particular, 
the discriminant for the jet probability algorithm has different distributions for different jet 
flavours for jet momenta in the range 30 < pj < 700 GeV/ c. The JP algorithm can be calibrated 
directly with data. Tracks with negative impact parameter are used to compute the probability 
that those tracks come from the primary vertex. The same calibration is performed separately 
in simulated samples. As a result, the JP algorithm serves as a reference for estimating the frac- 
tion of b jets in a data sample, and also for estimating the fraction of b jets in a subsample that 
has been selected by an independent tagging algorithm. In this manner the efficiency of the 
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independent algorithm can be measured. This method is called the lifetime tagging method 
("LT"). It can be performed on both inclusive and muon jet samples. The resulting scale factors 
are compared to obtain an estimate of the systematic uncertainty. 

The efficiency measurement is performed in inclusive jet events in which at least one jet must 
be above a given pj threshold, and separately in dijet events in which at least one jet is a muon 
jet. To increase the fraction of b jets in the inclusive sample, an additional jet tagged by the JPM 
algorithm is also required. The sample with muon jets is already sufficiently enriched in b jets 
by the muon requirement. The same set of samples can be established with simulated events, 
so that the true tagging efficiency can be measured there and a scale factor computed. 

Because a value of the JP discriminant can be defined for jets that have as few as one track with 
a positive impact parameter significance, the discriminant can be calculated for most b jets, 
regardless of their pj. The fraction of b jets that have JP information, Q,, rises from about 0.91 
at p T = 20 GeV/c to more than 0.98 for p T > 50 GeV/c. 



Figure 12 shows the JP discriminant distributions in the muon jet sample and the inclusive 
sample, before and after tagging the jets with an independent tagger, in this case the CSVM 
tagger. Also shown is a fit to the distributions using JP-discriminant templates derived from 
simulations of b, c, and light-parton jets. The normalization of the relative flavour fractions fb, 
f c and /light is left free, with the constraint that /b + /c + /light = 1- The b-jet tagging efficiency 
is the ratio of the number of b jets that are tagged by the independent tagger to the number of 
b jets before the tagging. The numbers are calculated using the fit. The b-jet tagging efficiency 
is corrected for the fraction of jets that have JP information. 

r f ta s \j ta s 

£ ta g _ *-b ' Jb ' JV data m 

b -bpfnrp tncr _ ^bpfnrp tacr 7 ^ ' 



.-before tag , ^before tag ' 
Jb ' iV data 

where the superscripts "before tag" and "tag" refer to the samples before and after application 
of the tagging criterion. 



Examples of the efficiencies measured for the JPL and CSVM taggers are shown in Fig. 13 In 
both cases the results from simulation are close to those obtained from data. 

This technique cannot be used to measure the efficiency of the JP algorithm itself, as the JP 
discriminant is used in the fit to determine the b-jet content of the sample. However, the CSV 
discriminant, which is mostly based on information from secondary vertices, can be used in 
its place to determine the flavour content. More than 90% of jets have CSV information, as is 
the case with the JP discriminant. But unlike the JP discriminant, the CSV discriminant cannot 
be calibrated solely with the data. To remedy this, the CSV discriminant is used to estimate 
the tagging efficiency of the TC algorithms. By comparing these results to those using the JP 
discriminant, the bias due to using the CSV discriminant is determined to be (0-2%, 4-6%, 6- 
9%) for the (loose, medium, tight) operating points. The efficiencies and scale factors for the JP 
algorithm are corrected for these biases. 

5.4 Systematic uncertainties on efficiency measurements 

Several systematic uncertainties affect the measurement of the b-jet tagging efficiency. Some 
are common to all four methods (PtRel, IP3D, System8, LT), some are common to a subset of 
them, and some are unique to a particular method. 

Common systematic uncertainties for all methods: 



5.4 Systematic uncertainties on efficiency measurements 



17 
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Figure 12: Fits of the summed b, c and light-parton templates, for simulated jets, to the JP- 
discriminant distributions from data, (a) and (c) show the results for muon jets before and after 
identification with the CSVM tagger, respectively, (b) and (d), the equivalent plots for inclusive 
jets. The jet pj is in the range 260 < pj < 320 GeV/ c. Overflows are displayed in the rightmost 
bins. 
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Figure 13: Efficiencies for the identification of b-jets measured for (a) the JPL and (b) the CSVM 
tagger with the LT method in the muon jet sample. Filled and open circles correspond to data 
and simulation, respectively. 



• Pileup: The measured b-jet tagging efficiency depends on the number of pp colli- 
sions superimposed on the primary interaction of interest. The systematic uncer- 
tainty is computed by varying the average value of the pileup in data by 10% and 
calculating the difference in the values of Sfb after reweighting the simulation with 
the modified distribution. 

• Gluon splitting: Studies of angular correlation between b jets at the LHC |40| indi- 
cate that QCD events may have a larger fraction of gluon splitting into bb pairs than 
is assumed in the generation of the simulation. A study was carried out with the MC 
sample where the number of events with gluon splitting was artificially changed 
by 50%. Results obtained with this modified gluon splitting MC sample are then 
compared to those with the original sample. The observed deviation is quoted as a 
systematic uncertainty. 

• Muon pj-. The central value of the b-jet tagging efficiency is extracted from data 
with muon pj>5 GeV/c. The choice of the selection affects the shape of the template 
distributions used in fits, and also the number of events used to measure the tagging 
efficiencies. The p\ threshold is varied up to 9GeV/c to test the sensitivity to this 
choice. 

Common uncertainty for the PtRel, IP3D and System8 methods: 

• Away-jet tagger: The dependency of the calculated b-jet tagging efficiency on the 
away-jet tagger is studied by comparing the results obtained by tagging the away 
jets with different variants of the TC algorithm (TCHEL, TCHEM, TCHPM). The 
measured SI\, tends to increase when the away tag is tighter. The maximum devia- 
tion from the default away-jet tagger is taken as a systematic uncertainty. 
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Uncertainty unique to the PtRel method: 

• Ratio of light-parton to charm jets in simulation: The shapes of the pj 1 and IP 

spectra for light-parton jets have been obtained from control samples in data, which 
minimizes the bias due to a mismodelling of the muon kinematics in the simula- 
tion. However, since the pj 1 distribution in data is fitted with a sum of templates 
for b jets and for c+udsg jets, uncertainties on the ratio between light-parton and 
charmed jets in the simulation must be considered. To do so, the predicted ratio 
is varied by ±20%, and the fit is repeated, taking the variation in the results as a 
systematic uncertainty. This uncertainty does not apply to the IP method, where a 
three-component fit is performed that determines the light-parton and charm con- 
tributions independently. 

Uncertainties unique to the System8 method: 

• Selection on pf 1 : One of the System8 criteria is a selection on the muon pf 1 > 
0.8 GeV/c. In order to test the sensitivity to the b purity in the muon jet sample and 
the relative charm /light-parton fraction in the non-b background, this selection was 
changed from 0.5 to 1.2GeV/c in the data. The correlation factors were recomputed 
accordingly in the simulation and the System8 method was applied again to the data 
in order to compute the b-jet tagging efficiency. The largest deviation observed from 
the central value is quoted as a systematic uncertainty. 

• MC closure test: The b-jet tagging efficiency can be directly calculated from the 
simulated QCD muon-enriched sample, as the flavour of the jets at generator level 
is known. In this case, the efficiency can be measured by taking the number of 
identified true b jets over all true b jets. The resulting value is denoted as the MC 
truth b-jet tagging efficiency. The System8 method is also applied to this MC sample. 
The resulting b-jet tagging efficiencies are in good agreement with the MC truth, 
giving a negligible systematic uncertainty. (This systematic uncertainty does not 
appear for the other methods as they rely on template fits, making such a test trivial.) 

Uncertainties unique to the LT method: 

• Fraction of b jets with JP information: The fraction of inclusive jets with JP infor- 
mation is well described by the simulation. As explained above, the number of b 
jets before tagging is measured by a fit to the JP distribution and corrected by the 
fraction Q, of b jets with JP information. A systematic uncertainty of half the resid- 
ual correction, (1 — Q,)/ (2Q,), is estimated from the simulation as a function of the 
b-jet pj. A corresponding factor with a similar uncertainty is needed for measuring 
the efficiency of the JP and JBP taggers with the CSV discriminator spectrum. 

• Difference between muon jets and inclusive jets: The difference between the mea- 
sured SFt, in muon jets and in inclusive b jets is taken as a systematic uncertainty. 
This is the largest contribution to the systematic uncertainty on SF\> with the LT 
method. Due to the large statistical uncertainty on Sfb for inclusive jets with pj < 
80GeV/c, the same systematic uncertainty is used for pj < 80GeV/c and for the 
range 80-210 GeV/c. If the difference on Sfb between muon jets and inclusive jets is 
smaller than the statistical error on Sfb for inclusive jets, this uncertainty is used for 
the systematic uncertainty estimate. 

• Bias for the JP and JBP taggers: The uncertainty on the measurement of the bias, 
when using the CSV discriminant to measure the efficiency of the JP and JBP tag- 
gers as estimated for the TC taggers, is propagated into the uncertainty on the scale 
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factors for these taggers. 

The systematic uncertainties on the data/MC scale factors for different tagging criteria are de- 
tailed in Tables [ljji] for the PtRel and System8 methods at low jet p T (80 < p T < 120GeV/c) 
and for the IP3D and LT methods at higher jet pj (160 < pj < 320GeV/c). In these momen- 
tum ranges the average uncertainty is about 3% for the PtRel method, 6-10% for the System8 
method, 3-4% for the IP3D method, and 2-7% for the LT method. 

Table 1: Relative systematic uncertainties on SF\, for the PtRel method in the muon jet pj range 
80-120 GeV/c, using the medium operating point of the b-jet tagging algorithms. 



b tagger 


pileup 


g ^bb 


Pt 


away jet 


light / charm 


total 


JPM 


1.6% 


1.6% 


1.1% 


1.9% 


0.4% 


3.2% 


JBPM 


0.9% 


0.5% 


1.8% 


1.2% 


0.5% 


2.5% 


TCHEM 


1.5% 


0.5% 


1.4% 


1.6% 


0.4% 


2.7% 


TCHPM 


1.1% 


0.1% 


2.3% 


1.1% 


0.6% 


2.8% 


SSVHEM 


0.7% 


1.3% 


2.0% 


0.4% 


0.4% 


2.5% 


CSVM 


1.4% 


1.3% 


1.3% 


1.2% 


0.5% 


2.6% 



Table 2: Relative systematic uncertainties on SF^ for the System8 method in the muon jet pj 
range 80-120 GeV/c, using the medium operating point of the b-jet tagging algorithms. 



b tagger 


pileup 


g^bb 


t 1 
Pt 


away jet 


pf 


MC closure 


total 


JPM 


1.4% 


0.6% 


4.2% 


3.9% 


1.6% 


0.1% 


6.1% 


JBPM 


1.5% 


1.9% 


6.5% 


1.5% 


4.0% 


<0.1% 


8.2% 


TCHEM 


1.3% 


1.3% 


6.6% 


2.1% 


2.4% 


<0.1% 


7.5% 


TCHPM 


1.3% 


2.7% 


8.2% 


1.9% 


4.0% 


0.1% 


9.7% 


SSVHEM 


1.3% 


0.1% 


3.7% 


2.8% 


3.0% 


<0.1% 


5.6% 


CSVM 


1.5% 


0.4% 


4.3% 


1.3% 


4.5% 


0.1% 


6.5% 



Table 3: Relative systematic uncertainties on Sir, for the IP3D method in the muon jet pj range 
160-320 GeV/c, using the medium operating point of the b-jet tagging algorithms. 



b tagger 


pileup 


g^bb 


¥ 

Pt 


away jet 


total 


JPM 


0.1% 


0.7% 


0.1% 


3.2% 


3.2% 


JBPM 


0.2% 


1.7% 


0.4% 


2.8% 


3.3% 


TCHEM 


0.1% 


1.2% 


0.4% 


2.4% 


2.7% 


TCHPM 


0.2% 


2.3% 


0.7% 


2.2% 


3.3% 


SSVHEM 


0.6% 


2.2% 


0.3% 


2.9% 


3.6% 


CSVM 


0.6% 


2.3% 


0.1% 


3.2% 


4.0% 



6 Efficiency measurement with tt events 

In the framework of the standard model, the top quark is expected to decay to a W boson and 
a b quark about 99.8% of the time [23j. Experimentally, the measurement of the heavy-flavour 
content of tt events can provide either a direct measurement of the branching fraction of the 
decay of the top quark to a W boson and a b quark, B(t — > Wb), or, assuming B(t — > Wb) = 1, 
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Table 4: Relative systematic uncertainty on St\, with the LT method in the muon jet pj range 
160-320 GeV/c, using the medium operating point of the b-jet tagging algorithms. 



b tagger 


pileup 


g^bb 


Pt 


c b 


inc. jets 


bias 


total 


JPM 


0.1% 


0.8% 


0.5% 


0.1% 


4.4% 


4.0% 


6.0% 


JBPM 


0.1% 


0.4% 


0.8% 


0.1% 


4.3% 


4.0% 


5.9% 


TCHEM 


0.1% 


1.6% 


0.3% 


0.2% 


2.8% 




3.2% 


TCHPM 


0.2% 


0.5% 


0.5% 


0.2% 


1.7% 




1.9% 


SSVHEM 


0.1% 


2.3% 


0.8% 


0.2% 


6.6% 




7.0% 


CSVM 


0.2% 


2.3% 


0.7% 


0.2% 


5.2% 




5.7% 



the b-jet tagging efficiency. The b jets in tt events have an average pj of about 80GeV/c and 
cover a pj range relevant for many processes both within the standard model and for many 
models beyond the standard model. 

In this Section, we present several methods to study the heavy-flavour content of tt events. The 
profile likelihood ratio (PLR) method, described in Section |6.3[ and the flavour tag matching 



(FTM) method, described in Section 6.5 use tt events in the dilepton channel in which both W 



bosons decay into leptons. The flavour tag consistency (FTC) method, described in Section 6.4 



and the bSample method (Section 6.6 1 use tt events in the lepton+jets channel, in which one 
W boson decays into quarks and the other into a charged lepton and a neutrino. These meth- 
ods are used to measure the efficiency of tagging b jets to be measured in the data and the 
simulation over the average pj and t] range of jets in the top-quark events. The differences in 
efficiencies observed between the data and MC simulation are provided as a data/MC scale 
factor SF;, similar to the techniques described in Section|5] 

6.1 Event selection 

The event reconstruction used herein follows closely the event selection performed for the tt 
production cross section measurements l|22ll4T| , with the exception of the b-jet tagging require- 
ments. All objects are reconstructed using a particle-flow algorithm. 

In the lepton+jets channel, the final state is composed of four jets, one energetic isolated muon 
and missing transverse energy. Events are required to pass a single-muon trigger. After offline 
reconstruction, events are selected requiring exactly one isolated muon with pj > 30GeV/c 
and \tj | < 2.1 and at least four jets with pj > 30GeV/c and \rj\ < 2.4. The FTC method further 
requires that the two leading jets have transverse momenta greater than 70 GeV/c and 50 GeV/c 
respectively, and that the transverse momentum of the muon is greater than 35GeV/c. The 
reconstructed missing transverse energy ($x) is required to be above 20 GeV. 

In the dilepton channel, the final state is composed of two jets, two energetic isolated leptons 
(electron or muon) and missing transverse energy. Events are required to pass dilepton triggers 
in which two muons, two electrons, or one electron and one muon are required to be present. 
After offline reconstruction, events are selected with two isolated, oppositely charged leptons 
with pj > 20GeV/c and \t]\ < 2.5 (2.4) for electrons (muons), at least two jets with pj > 
30GeV/c and \t]\ < 2.4, and #j > 30 GeV for ee/ji}i events. The selected leptons and jets 
are required to originate from the same primary interaction vertex. Events with same-flavour 
lepton pairs in the dilepton mass window (76 < niu < 106GeV/c 2 ) are removed to suppress 
the dominant Z+jet background. Dilepton pairs from heavy-flavour resonances and low-mass 
Drell-Yan production are also removed by requiring a minimum dilepton invariant mass of 
12GeV/c 2 . 
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Table 5: Number of observed and predicted events in the lepton+jets sample after applying all 
selection requirements of the FTC method. All MC samples have been scaled to an integrated 
luminosity of 2.3 fb -1 . The uncertainties include the uncertainties on the luminosity and the 
cross sections. The CSVM operating point has been used for the b-jet tagging requirement. 





no tagging 


> 1 b-tagged jets 


> 2 b-tagged jets 


tt 


8504 ± 1275 


7425 ± 1113 


3744 ± 561 


Single top 


477 ± 82 


394 ± 118 


162 ± 49 


W+jets 


6170 ± 1851 


1367 ± 410 


214 ± 64 


Z+jets 


459 ± 138 


83 ±25 


15 ±5 


QCD 


23 ± 7 


3 ± 1 


0.20 ± 0.06 


Total prediction 


15633 ± 2253 


9272 ± 1921 


4134 ± 566 


Data 


14391 


8781 


3897 



The numbers of observed and predicted events in the lepton+jets channel and the dilepton 
channel are given in Tables [5] and |6j respectively. The uncertainties include the uncertainties 
on the luminosity measurement and the cross sections. For all MC predictions, events are 
reweighted to take into account differences in trigger and lepton selection efficiencies between 
data and simulation [221 W\\ . The lepton selection efficiency scale factors are estimated from 
data using Z events. For dilepton events, the trigger efficiencies are estimated on a data sample 
using a trigger that is weakly correlated to the dilepton triggers. The dilepton trigger selection 
efficiency is estimated on events which contain two leptons that fulfil the complete dilepton 
event selection. 

The Drell-Yan background is measured using data. Two different methods are used, and the 
two estimates are compatible. In the PLR method, for the ee and yty. channels, the ratio of 
Drell-Yan events outside and inside the dilepton invariant mass window, R ut/iiv is estimated 
from the simulation. This is used to estimate the Drell-Yan background using the number of 
data events inside the dilepton invariant mass window |4l]. A contamination from other back- 
grounds can still be present in the Z-mass window, and this contribution is subtracted using the 
ep channel scaled according to the event yields in the ee and jip channels. For the e}i channel, 
the DY background yield is estimated after performing a binned maximum-likelihood fit to 
the dilepton invariant mass distribution. In the FTM method, the number of Drell-Yan events 
ee and channels is estimated from the shape of the distribution of the angle between the 
momentum of the two leptons. For the eji channel, the predictions are taken from simulation. 

6.2 Systematic uncertainties 

Most of the sources of systematic uncertainties are common to all methods, and several meth- 
ods have specific additional contributions. A description of the common systematic uncer- 
tainties is given in this section. The description of the procedure to estimate the systematic 
uncertainties in each analysis and the influence of the different sources will be given separately 
for each analysis in its relevant section. 

There are different sources of uncertainties originating from detector knowledge or related to 
the theory and the simulation. These uncertainties can affect the normalization factor for each 
process or they can distort the distributions themselves. 

The dominant sources of uncertainty arise from the MC simulation. The uncertainty due to the 
modelling of the underlying event is estimated by comparing results between the main sample 
generated with the Z2 tune to that with the D6T tune ||42| . The effect due to the scale used to 
match clustered jets to partons (i.e., jet-parton matching) is estimated with dedicated samples 
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Table 6: Number of observed and predicted events in the dilepton sample after applying all 
selection requirements of the PLR method. All MC samples have been scaled to a luminosity 
of 2.3 fb _1 . The uncertainties include the uncertainties on the luminosity and the cross sections. 
The TCHEL operating point has been used for the b-jet tagging requirement. The component 
"tt signal" stands for the dilepton events. The component "tt other" contains the events in all 
other decay channels. 

Processes Channel ee Channel jifi Channel eji 



Without b-jet tagging requirement 



tt signal 


971 ± 147 


1275 ± 182 


3453 ± 521 


tt other 


11.5 ± 5.7 


3.3 ± 1.7 


23.6 ± 11.8 


Single top 


48.7 ± 14.6 


62.7 ± 18.9 


163.7 ± 49.0 


Di-bosons 


22.3 ± 6.7 


29.2 ± 8.8 


49.4 ± 14.8 


Z+jets 


409 ± 204 


545 ± 273 


200 ± 100 


W+jets 


12.0 ± 6.0 


< 0.5 


11.4 ± 5.7 


Total prediction 


1475 ± 259 


1915 ± 343 


3902 ± 512 


Data 


1442 


1773 


3898 


With > 1 b-tagged jets 


Total prediction 


1088 ± 170 


1429 ± 218 


3390 ± 475 


Data 


1080 


1364 


3375 


With > 2 b-tagged jets 


Total prediction 


529 ± 73 


697 ± 97 


1827 ± 263 


Data 


554 


686 


1854 



generated by varying the nominal matching pj thresholds by factors of 2 and 1/2. Effects due to 
the definition of the renormalization and factorization scales used in the simulation of the signal 
are studied with dedicated MC samples with the scales varied simultaneously by factors of 2 
and 1/2. The uncertainties related to the parton distribution function (PDF) used to model the 
hard scattering of the proton-proton collisions are estimated by varying the parameters of the 
PDF by Her with respect to their nominal values and using the PDF4LHC prescription |fT3llT5| . 
Variations in the relative composition of the simulated samples are studied by varying the 
contributions of each background with respect to the signal and each other. 

Several systematic uncertainties pertain to the modelling of the CMS detector in the MC sim- 
ulations. Important uncertainties are the energy scales of the jets and, to a lesser extent, of 
the leptons, as they shift the momenta of the reconstructed objects. Similarly, the uncertainty 
in jet energy resolution has also been considered. The effects of the jet energy scale are taken 
into account by varying the energy scale of the jets according to its uncertainty fl28| . A further 
source comes from the uncertainties associated with the measurement of the trigger and lepton 
selection efficiencies. The uncertainty due to pileup is evaluated by varying the mean value of 
the measured pileup distribution by ±10%. 

6.3 Profile likelihood ratio method 

In this method, the data/MC scale factor of the b-jet tagging efficiency is measured with the 
PLR method using the 2-dimensional distribution of the jet multiplicity versus the b-tagged 
jet multiplicity in dilepton events. The uncertainties in the event yield and in the shape of 
the distribution are considered as nuisance parameters in the likelihood function and are then 
fitted during the minimization procedure. This leads to combined statistical and systematic 
uncertainties associated with the measurement of the scale factor. 
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CMS, 2.3 fb" 1 at is = 7 TeV 




Figure 14: Number of b-tagged jets per event in the dilepton channel for the TCHEL operat- 
ing point, in the data (filled circles) and the simulation (solid histograms), before the fit. The 
simulated distribution is normalized to an integrated luminosity of 2.3 fb -1 . The bin labels 
(m,n) refer to the number of events with m jets in the event of which n are tagged. The compo- 
nent "tt signal" are the dilepton events, and the component "tt other" contains the events in all 
other decay channels. The hatched area corresponds to the combined statistical and systematic 
uncertainty. 

The likelihood function for a given dilepton channel j (ee, eji or and a given bin i of the 
2-dimensional distribution (corresponding to n jets and m b-tagged jets) is written as |43|: 

£ !V (Sfb,N<f ,{14}) = VotssoniNf^i^S^iUk})) x Y[Gauss(U kf 0,1) , (5) 

k 

where Nf? s is the number of observed events, yi^ the number of expected events, and 14 the 
nuisance parameters. The distribution of the number of b-tagged jets observed in data and 
predicted in the simulation for the TCHEL operating point for tt and background events is 



shown in Fig. 14 The likelihood function for a given channel j is then the product of the 



likelihood functions over all the bins of the distribution: 

^•(Sfuiv;* 8 },{i4}) = ]Jc i , j (sF bf N^ s ,{u k }) . (6) 

i 

Since the decay channels are statistically independent, the overall likelihood function is then 
simply the product of the individual channel likelihoods: 

C(SF b ,{N$ s },{U k })=Yl£j. (7) 



The expression of the profile likelihood ratio LR is then 



6.4 Flavour tag consistency method 
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where ii, represents the conditional maximum likelihood estimates of IT, obtained with the 
scale factor Sfb fixed while Sfb and Uj are the estimates obtained with Sfb free. 

The distribution of — 2 ln(LR(Sfb)) is asymptotically distributed as a x 2 distribution with one 
degree of freedom (Wilk's theorem |44]). An LR curve is obtained by scanning the values of 
Sfb in a given range and used to determine a 68% confidence level interval. These uncertainties 
are the combination of the statistical uncertainty and the systematic uncertainties considered 
as nuisance parameters. All the nuisance parameters are common to the three channels except 
the estimation of the backgrounds from data for W+jets and Z+jets. The Z+jets background 



is estimated from data as described in Section 6.1 The small W+jets background is estimated 
from data using the matrix method 

The expected number of b-tagged jets in events with n jets of a given dilepton final state, y.\ in 
Eq.([5]|, is derived from pre-tagged simulated events with n jets. This is carried out by applying 
per-jet b-jet tagging efficiencies, considering all jet tagging combinations. These efficiencies are 
derived as a function of px and rj, using simulated tt events for b jets and using data samples 
dominated by light-parton jets. A constant scale factor Sfb is applied to the b- and c-jet ef- 
ficiencies to model the b-jet tagging efficiency in data. The value of Sfb is then extracted by 
minimizing the LR as described above. A closure test is performed on simulated signal events 
to check that, for a unit scale factor, the b-tagged jet multiplicity distribution obtained with the 
reweighting procedure is the same as the one obtained directly from MC simulation using a 
requirement on the b-jet tagging discriminant. 

Several uncertainties are considered as nuisance parameters in the likelihood function and are 
then fitted during the minimization procedure. These are the uncertainties on the energy scale 
of the jets and the leptons, the expected number of events of the different contributions, and 
the uncertainty on the light-parton jet scale factor. 

Further contributions to the systematic uncertainties are estimated outside the PLR procedure. 
The expected input distributions to the PLR method are re-derived, using MC samples with 
varied parameters, and the b-jet tagging scale factors are re-measured. The relative differences 
of Sfb with respect to the nominal values are taken as systematic uncertainties, and added in 
quadrature to the total uncertainty from the fit. These uncertainties include the uncertainties on 
the jet-parton matching scale, the parton-shower/ matrix-element threshold, and the top mass. 

The factorization scale is the dominant systematic uncertainty, as it affects the jet multiplicity 
distribution, with a relative uncertainty of approximately 1.7% on the scale factor of the CSVL 
operating point. The second-largest contribution is from the uncertainty on the tt event yield, 
which is estimated to be 20%. It includes the uncertainties on the tt cross section, the trigger 
and lepton selection efficiencies, and the branching fraction of the decays of the W bosons. 
This results in an uncertainty of 1.4% on the scale factor of the CSVL operating point. Further, 
the statistical uncertainty on the b-jet tagging efficiency in the simulation was found to range 
between 0.4% and 1.6% depending on the operating point considered. A 1.6% systematic un- 
certainty on the scale factor was therefore chosen for all the operating points. 

Finally, to account for a possible uncertainty coming from the fitting algorithm itself, an addi- 
tional uncertainty is estimated using different choices of the likelihood minimization. This is 
taken as a 1% relative uncertainty. 

6.4 Flavour tag consistency method 

The FTC method requires consistency between the observed and expected number of tags in 
the lepton+jets events to study the performance of the heavy-flavour algorithms. 
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In a sample of tt pair candidates in the lepton+jets channel, the expected number of events with 
n b-tagged jets (N„) can be written as 

(N H ) = L • <r ff • e ■ £ % ' " ' £ * "Vf 4(1 - B b )^C^i(l - e c )(H') C f e f (1 - £l f- k % (9) 

i'+f+k'=n 

where L is the integrated luminosity, cr^ is the tt cross section, e is the pre-tagging selection 
efficiency, C h a is the binomial coefficient, and £ b , e C/ and ej are the b-, c-, and light-parton jet 
tagging efficiencies. The factors Fm are the fractions of events with i b jets, c jets, and k light- 
parton jets. They are derived from the tt simulation in which the true flavour of the jets is 
known. 

As an example, the Fxn term contributes to the expected number of events with 1 b-tagged jet 
(Ni) in the following way: 



(Ni) o< Fi 12 x 



/ \ 

1 • £b (l - e c )(l - £l ) 2 + 1 • (1 - e b )e c (l - £i) 2 + 2 • (1 - e b )(l - e c ) £l (l - El ) 

V v ' S v ' V v ' 

\ the b jet the c jet the light-parton jet J 



(10) 



To account for the non-negligible amount of background, Eq. ([9]) is modified to include each 
background sample: 

( Nn ) = (N«) + ( N background ) 

i'<i,j'<j,k'<k 

t (■••) 

i,j,k i'+j'+k'=n 

rr e i' <i,f <j,V <k 

^background tbackground r-i _background ST I \ 

+ ~ — U b i)k U \'") 

"tt fc tt j^jc i'+/'+fc / =n 

where (• • • ) stands for the expression in square brackets from Eq. (|9). 



(11) 



The tagging efficiencies and the tt production cross section are then measured by minimizing 
the log-likelihood function: 

C = -2\ogYlVoisson{N n , (N n )), (12) 

n 

where N n is the number of observed events with n b-tagged jets. The distribution of the number 
of b-tagged jets observed in data and predicted in the simulation before and after the fit for tt 
and background events is shown in Fig. [l5j 

In the current implementation the likelihood only uses the b-tagged jet multiplicity in tt lep- 
ton+jets events with between four to seven reconstructed jets, as it emphasizes the measure- 
ment of the heavy-flavour b-jet tagging efficiency. The b-jet tagging efficiencies and tt cross 
section are treated as free parameters in the fit. The tt cross section determined in the fits are 
consistent with the published values. The c- and light-parton-jet tagging efficiencies are taken 
from the simulation corrected for the data/MC scale factors. 

The systematic uncertainties are determined from ensembles of pseudo-experiments. In each 
of these pseudo-experiments, the number of signal and background events are generated using 
Poisson statistics, using as mean values the number of expected events in each channel. Events 
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CMS, 2.3 f b 1 at is = 7 TeV 




2 3 4 

Number of Tagged Jets 



Number of Tagged Jets 



Figure 15: Number of tagged jets per event in the lepton+jet channel with the FTC method with 
the CSVM operating point, in the data (filled circles) and the simulation (solid histograms), (a) 
before and (b) after the fit. The simulated distribution is normalized to an integrated lumi- 
nosity of 2.3 fb -1 . The hatched area corresponds to the combined statistical and systematic 
uncertainty. 

are then randomly chosen in the simulated samples and the b-tagged jet multiplicity distribu- 
tions are populated according to the simulated jet multiplicity in each event. The measurement 
is then performed as described above using the factors from the nominal simulation. The 
average b-jet tagging efficiency is compared to the average b-jet tagging efficiency value mea- 
sured in ensemble tests with the nominal samples. The difference is taken as a systematic 
uncertainty. 

The dominant contribution is the uncertainty on the jet energy scale, with a relative uncertainty 
of 2.2% on the scale factor of the CSVL operating point. The second-largest uncertainty arises 
from the uncertainty on the production cross section of the W+heavy flavour jets, with a rel- 
ative uncertainty of 0.97%. The uncertainties due to the factorization scale and the jet-parton 
matching are 0.41% and 0.35%, respectively, for the CSVL operating point. 

6.5 Flavour tag matching method 

The FTM method requires consistency between the observed and expected number of tags in 
dilepton events. The expected number of events with n b-tagged jets (N n ) is written as 



all jets 

(N n ) = £ n k ■ P„ /k , 

k jets=2 



(13) 



where n k is the observed number of events with k jets, and P n ^ is the probability to count n 
b-tagged jets in a /c-jet event. These probability functions are written in terms of the tagging 
efficiencies and the expected jet composition. 

In order to illustrate explicitly the construction of the probability functions, the exclusive two- 
jet multiplicity bin is used and the following expression is obtained: 



2 

\l = X] a <" P n,2,i , 
i jets=0 
from top decay 



(14) 



where P n ,2,i is the probability that n b tags are observed in an event with two jets of which i jets 
come from tt decays. 
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The misassignment probabilities a, denote the probability in the sample that i jets from the 
decay of the tt pair have been reconstructed and selected. These are normalized such that 
DC, = 1. For example, di is the probability that both b jets from the tt decay have been 
selected. They take into account both the contribution from the background, which is small in 
the dilepton channel, and jet misassignment. Either or both of the jets from the decays of the 
two top quarks may not be selected, and jets from initial- and final- state radiation, or jets from 
the proton recoil may enter the selection, further diluting the sample. 

As an example, for the case where two tagged jets are found in a two-jet event, the probabilities 
can be explicitly written as: 

^2,2,0 = £q if n o jets are from tt decays; 

^2,2,1 = 2£b e q if 1 jet is from tt decays; (15) 

^2,2,2 = e b ^ ^ jets are from tt decays. 

The misidentification probability £ q is an effective measurement of the probability of tagging 
gluon, light and charm quark jets in the dilepton sample. Similar expressions can easily be 
derived for the other jet multiplicity bins. 

The misassignment probabilities are determined from data, and used in the subsequent likeli- 
hood of the b-tagged jet multiplicity distribution. In order to estimate the actual fraction of b 
jets from top-quark decays in the selected sample, kinematic properties of the top decay topol- 
ogy are used. The invariant mass of the lepton-jet pairs from a t -¥ Wb decay have a kinematic 

end-point at Mf£ x = \Jm^ — « 156 GeV/c 2 . The invariant mass of misassigned lepton-jet 
pairs exhibits a longer tail towards high mass values. The shape of the misassigned pairs can 
be modelled by mixing lepton-jet pairs from different events or randomly changing the lepton 
momentum direction. The fraction of jets from t —¥ Wb decays can thus be measured normal- 
izing the spectrum obtained from the combinatorial model to the number of pairs observed in 
the tail (i.e. Mg ib > 180GeV/c 2 ). This is estimated independently for each dilepton channel 
and for each jet-multiplicity bin. The procedure is checked and found to be unbiased from 
MC pseudo-experiments. Taking into account the expected contribution of tt and single-top 
events to the final sample, the sample composition in terms of events with 2, 1, or correctly 
reconstructed and selected b jets is estimated. 

The b-jet tagging efficiency £b can then be measured by maximizing the likelihood function: 

all jets 

C(e b ,e q ,cti) = Y[ Voisson(N„, (N„)) , (16) 

n=0 

where N n is the observed number of events with n b-tagged jets. 

The likelihood only uses the b tagged jet multiplicity in tt dilepton events with two and three 
reconstructed jets. Gaussian constraints are added for the effective c- and light-parton jet tag- 
ging efficiency £ q and the misassignment probabilities: 

all jets 

£= Yl 'Poisson(N n/ (N„)) ■ Y\Qauss(ai,Ki,a ai ) ■ Qauss(e q ,e q ,(r eq ) . (17) 

n=0 i 

The central value £ q and width <7 £q of £ q are determined from the simulation. For the misassign- 
ment probabilities a ; , the central values a, are taken from the measurement described above, 



6.6 Efficiency measurement from a b-enriched jet sample 
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and the width cr K . derived from the uncertainty of the expected contribution of tt and single-top 
events to the final sample. 

The systematic uncertainties affect the measurement of the b-jet tagging probability through 
their effect on the parameters of the fit, namely the measured misassignment probabilities and 
the misidentification probability for non-b jets. The effect on the measured misassignment 
probabilities is determined from ensembles of pseudo-experiments, where, for each source of 
uncertainty, the bias on the probabilities is determined. Most sources of uncertainties such as 
jet energy scale and resolution, and pileup have little effect. This is because the method used to 
derive the misassignment probabilities is based on templates for the lepton-jet invariant mass 
obtained from control samples in data. Other sources, which might affect the contribution from 
top-quark decays and from initial- and final-state radiation jets to the final sample, are evalu- 
ated using samples where the QCD factorisation and renormalisation scales and the jet-parton 
matching scales are varied. In the pseudo-experiments the standard tt sample is substituted by 
each of these samples and the process is repeated. 

This bias is then used to shift the measured misassignment probabilities. The likelihood fit 
of the data is repeated with the modified values. The difference with respect to the nominal 
result is taken as the systematic uncertainty. The same procedure is applied to evaluate the 
uncertainty on the misidentification probability for non-b jets. 

The final uncertainty is dominated by factors which tend to increase the contamination of back- 
ground or alter the jet environment. The main uncertainties are the factorization scale, and to 
a smaller extent, the jet-parton matching with relative uncertainties on the scale factor of 2.3% 
and 1.4% respectively, for the CSVL operating point. The second-largest uncertainty arises from 
the 1.5% uncertainty on the light-parton jet tagging efficiency. 

6.6 Efficiency measurement from a b-enriched jet sample 

In this method, the b-jet tagging efficiency is measured from a sample enriched with b jets 
(bSample) in lepton+jets events. The contamination of this sample due to light-parton jets is 
estimated from data and subtracted. 

In order to select the correct jets originating from the decay of the top quarks, a x 2 is calculated 
for each jet-parton combination based on the masses of the reconstructed W boson m qq and the 
hadronically decaying top quark mb qq : 

2 _ f m bqq-m t \ 2 /m qq -m w x2 



^m + m' <i8) 

The mean masses and widths are obtained from the tt simulation using a Gaussian fit to the 
mass distributions of the combination with the correct jet-to-quark assignment. The mean and 
width of the reconstructed top-quark mass distribution are 172.5 GeV/c 2 and 16.3GeV/c 2 , re- 
spectively. The mean and width of the reconstructed W-boson distribution are 82.9 GeV/c 2 and 
9.5 GeV/c 2 , respectively. Using the four leading jets, with transverse momenta above 30 GeV/c, 
there are 12 combinations to pair the four reconstructed jets with the quarks from tt decay. The 
combination with the lowest x 2 is selected to represent the event topology. The event is rejected 
if the lowest x 2 is above 90. 

A generic b-candidate sample is constructed by taking the jet assigned to the lepton. This 
sample is further subdivided into b-enriched and b-depleted subsamples by using the invariant 
mass of that jet and the muon (called the jet-muon mass, m^j). The distribution of this variable 



is shown in Fig. 16 For the b-enriched subsample, the jet-muon mass is required to be in the 
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6 Efficiency measurement with tt events 



range 80 < m„; < 150GeV/c 2 . For the b-depleted subsample the jet-muon mass is required 
to be in the range 150 < m„; < 250GeV/c 2 . Based on the simulation, the purities of the two 
subsamples are 45% and 16%, respectively. 




400 450 500 

(GeV/c 2 ) 

Figure 16: Distribution of the jet-muon mass in the data (filled circles) and the simulation (solid 
histograms). The simulated distribution is normalized to an integrated luminosity of 2.3 fb -1 . 
The component "tt signal" stands for the lepton+jet events. The component "tt other" contains 
the events in all other decay channels. The hatched area corresponds to the combined statistical 
and systematic uncertainty. Overflow entries are added to the last bin. 



The distribution of the discriminators of the taggers for true b jets, A£ nr , is obtained by subtract- 
ing the discriminator distribution of the b-depleted subsample, A^ epl from the discriminator 
distribution of the b-enriched subsample, A£ nr : 

A£ nr = Ajf r -Fx pl • (19) 

The factor F represents the ratio of the number of non-b jets in the b-enriched and b-depleted 
subsamples. It is measured from a background dominated sample composed mainly of light- 
flavour quark jets. This sample is obtained by using the jets attributed to the decay of the W 
boson and ensuring that they both fail the b-jet tagging requirements of the TCHEM operating 
point. Both jets are used to construct a jet-muon mass distribution, and the same subsamples 
are defined as for the signal sample. The purity of light-flavour quark jets is 92% in the region 
80 < m ft j < 150 GeV/c 2 and 95% in the region 150 < m }lj < 250 GeV/c 2 . To match the shape of 
the jet-muon mass distribution of this background sample to that of the signal sample, the jets in 
the background sample are reweighted according to the (pj, tj) of the signal sample. After this 
reweighting, the two samples have similar jet-muon mass distribution. The factor F is taken as 
the ratio of the number of events in the 80 < m„,- < 150 GeV/c 2 and the 150 < m w < 250 GeV/c 2 
regions in the background sample, and is found to be 1.16 ± 0.02. 

A small correlation between the jet-muon mass and the discriminators has to be corrected for. 
This correlation is attributed to the correlation between the transverse momentum of the jet 
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and the jet-muon mass. This correlation distorts the distribution of the discriminants of the 
b-jet tagging algorithms in the b-depleted subsample with respect to the distribution of the 
non-b jets in the b-enhanced subsample. This effect is corrected by reweighting the jets in the 
b-depleted subsample according to the transverse momentum distribution of the jets in the 
b-enhanced subsample. 

The systematic uncertainties for the b-jet tagging efficiency and the scale factors are the absolute 
differences between the nominal simulation sample and the sample with modified parameters. 
Additionally a systematic uncertainty is assigned based on tests of the method in simulation. 
The tests show no bias in the method with an uncertainty driven by statistical uncertainties 
on available samples. For the CSVL operating point, the relative uncertainty is 3.1%. The jet 
energy scale and resolution have a small contribution from the change in the mean masses 
and widths used for the x 2 - For the CSVL operating point, the relative uncertainties on the 
scale factor are 1.4% and 2.2%, respectively. A small uncertainty of 0.5% is due to the choice 
of the boundaries of the b-depleted region. The high tail of the jet-muon mass distribution 
is composed mainly of background events and wrongly combined jets that do not reflect the 
kinematics of the signal events. The effect of imposing an upper limit on the region is assessed 
by varying the boundary between 200 and 300 GeV/c 2 . 

7 Efficiency measurement results 
7.1 Results from multijet events 

The methods described in Section [5] cover a large range of jet transverse momenta. The PtRel 
and the System8 methods provide precise measurements for the lower part of the spectrum. 
The IP3D and the LT methods have been designed for high jet pj. The measured data/MC 
scale factors are given in Table [7] for jets with low p T from 80 to 120 GeV/c, and in Table g for 
jets with high px, from 160 to 320GeV/c. In these ranges the methods give compatible results 
within the quoted uncertainties. While some of the methods measure the efficiencies and scale 
factors only for muon jets, and not inclusive b jets, simulation studies have shown that the 
difference in tagging efficiencies between the two are only a few percent. We assume that these 
small differences have no significant effect on the scale factors, which are relative data/MC 
measurements. 

The results have been combined to provide the best measurements of the data/MC scale factors 
for 30 < px < 670GeV/c. For each jet px range the most precise results have been used: the 
PtRel and System8 methods for p T < 120GeV/c, the IP3D method for p T > 120GeV/c and the 
LT method for the full momentum range. 

The combination is based on a weighted mean of the scale factors in each jet px bin |45l . How- 
ever, there are a significant number of jets from QCD dijet and multijet events (with at least one 
muon associated to a jet) which are shared between the methods. The shared fraction of jets 
varies with jet pj. Typical values are 10-25% between the LT and PtRel/IP3D methods, 40-50% 
between the PtRel and System8 methods, and 20-50% between the System8 and LT methods. 
This overlap has been taken into account in the combination. 

Several sources of systematic uncertainties are common for all methods: the effects due to 
pileup, gluon splitting, and the selection criteria for muons. The muon PtRel and IP3D methods 
have the same sensitivity to the choice of the away-jet tagger. The corresponding uncertainties 
were assumed to be fully correlated or anticorrelated according to the sign of the variations ob- 
served for the different methods. All other systematic effects are specific to individual methods 
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Table 7: Data/MC scale factors SF^ as measured using the PtRel, System8, and LT methods and 
their combination. Results are given for jet pj between 80 and 120 GeV/c. The first uncertainty 
on Sfi, is statistical and the second is systematic. For the combination the total uncertainty is 
quoted. 

b tagger Sfj, (PtRel) SF b (System8) Sfj, (LT) Sfj, (comb.) 

JPL 0.98 ± 0.01 ± 0.03 1.00 ±0.02 ±0.07 1.00 ± 0.01 ± 0.04 0.99 ± 0.03 

JBPL 0.99 ± 0.01 ± 0.02 0.98 ± 0.02 ± 0.04 1.01 ± 0.01 ± 0.04 0.99 ±0.02 

TCHEL 0.99 ± 0.01 ± 0.02 0.97 ± 0.02 ± 0.05 1.00 ± 0.01 ± 0.01 1.00 ±0.01 

CSVL 1.00 ±0.01 ±0.02 1.01 ±0.02 ±0.06 0.98 ± 0.01 ± 0.02 0.99 ±0.02 

JPM 0.90 ± 0.01 ± 0.03 0.93 ± 0.03 ± 0.06 0.99 ± 0.01 ± 0.05 0.92 ± 0.03 

JBPM 0.92 ± 0.01 ± 0.02 0.96 ± 0.03 ± 0.08 0.99 ± 0.01 ± 0.05 0.91 ±0.03 

TCHEM 0.94 ± 0.01 ± 0.03 0.99 ± 0.03 ± 0.07 0.98 ± 0.01 ± 0.03 0.95 ±0.02 

TCHPM 0.95 ± 0.01 ± 0.03 0.94 ± 0.02 ± 0.09 0.97 ± 0.01 ± 0.02 0.96 ±0.02 

SSVHEM 0.92 ± 0.01 ± 0.02 0.92 ± 0.03 ± 0.05 0.97 ± 0.01 ± 0.02 0.95 ±0.02 

CSVM 0.93 ± 0.01 ± 0.02 0.97 ± 0.03 ± 0.06 0.97 ± 0.01 ± 0.03 0.95 ±0.02 

JPT 0.82 ± 0.01 ± 0.05 0.85 ± 0.03 ± 0.07 0.96 ± 0.01 ± 0.07 0.87 ± 0.05 

JBPT 0.83 ± 0.01 ± 0.06 0.89 ± 0.03 ± 0.11 0.96 ± 0.01 ± 0.08 0.87 ±0.06 

TCHPT 0.87 ±0.01 ±0.05 0.91 ± 0.03 ± 0.10 0.94 ± 0.01 ± 0.04 0.91 ±0.04 

SSVHPT 0.87 ±0.01 ±0.03 0.84 ± 0.03 ± 0.10 0.96 ± 0.01 ± 0.03 0.92 ±0.03 

CSVT 0.86 ± 0.01 ± 0.04 0.92 ± 0.03 ± 0.07 0.94 ± 0.01 ± 0.04 0.90 ±0.03 



and have been treated as uncorrelated. A conservative value for the uncertainty is used if the x 2 
from the fit exceeds the number of degrees of freedom, in which case the uncertainty is scaled 
by the square root of the normalised x 2 - Summaries for the individual and combined scale fac- 



tor measurements for the JPL and the CSVM taggers are shown in Fig. 17 Also shown are the 
parameterizations of the combined scale factor of the form SF\,(pj) = a(l + (1 + 7Pt)- 

Combined values for a low and a high jet pj range are shown in the right hand columns of 
Tables [7] and |HJ respectively. 

7.2 Results from tt events 



The statistical properties of the four methods presented in Sections 6.3 to 6.6 have been studied 
using ensembles of pseudo-experiments based on the expected numbers of signal and back- 
ground events. The distributions of the estimated values and their uncertainties show that the 
methods are unbiased. This is shown by the pull distributions, which have mean values close 
to zero and standard deviations close to one. 

The scale factors SJ-j, = e? ieas /e^ IC measured with the different algorithms are shown in Table g 
using data with an integrated luminosity of 2.3 fb -1 . The scale factors were stable over the 
whole data-taking period and can be applied to the full dataset. The measured b-jet tagging 



efficiencies and scale factors for the CSV algorithm are shown in Fig. 18 



The PLR and FTC methods are used to calculate a combined scale factor for use in analyses, 
by taking the weighted mean of the scale factors from each method. The two methods are 
chosen because each has the smallest uncertainty among the analyses in its respective decay 
channel. By choosing one analysis in the dilepton channel and one in the lepton+jets channel, 
there is no statistical correlation between the two measurements as the samples are mutually 
exclusive. Based on the statistical and systematic uncertainties of the PLR and FTC methods, 
the uncertainty of the resulting scale factor is ±0.03 for all operating points. 
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8 Misidentification probability measurement 



Table 8: Data/MC scale factors SI\, as measured using the IP3D and LT methods and their 
combination. Results are given for jet px between 160 and 320GeV/c. The first uncertainty 
on Sfi, is statistical and the second is systematic. For the combination the total uncertainty is 
quoted. 

b tagger Sfj, (IP3D) Sfj, (LT) Sfj, (combTT 

JPL 0.99 ±0.01 ±0.01 1.00 ±0.01 ±0.06 0.99 ±0.02 

JBPL 1.00 ±0.01 ±0.01 1.00 ±0.01 ±0.05 1.00 ±0.02 

TCHEL 1.00 ±0.01 ±0.02 1.00 ± 0.01 ± 0.02 1.00 ±0.02 

CSVL 0.98 ± 0.02 ± 0.01 0.96 ± 0.01 ± 0.04 0.97 ±0.03 

JPM 0.93 ± 0.02 ± 0.03 0.99 ± 0.01 ± 0.06 0.95 ± 0.04 

JBPM 0.97 ±0.02 ±0.03 0.99 ± 0.01 ± 0.06 0.97 ±0.04 

TCHEM 0.96 ± 0.02 ± 0.03 0.97 ± 0.01 ± 0.03 0.96 ±0.03 

TCHPM 0.97 ±0.02 ±0.03 0.97 ± 0.01 ± 0.02 0.97 ±0.02 

SSVHEM 0.98 ± 0.02 ± 0.04 0.98 ± 0.01 ± 0.07 0.98 ±0.04 

CSVM 0.95 ± 0.02 ± 0.04 0.97 ± 0.01 ± 0.06 0.96 ±0.04 

JPT 0.89 ± 0.02 ± 0.04 0.95 ± 0.01 ± 0.10 0.91 ± 0.05 

JBPT 0.91 ± 0.02 ± 0.03 0.96 ± 0.01 ± 0.11 0.92 ±0.05 

TCHPT 0.89 ± 0.02 ± 0.03 0.94 ± 0.01 ± 0.04 0.92 ±0.04 

SSVHPT 0.92 ± 0.02 ± 0.04 0.96 ± 0.01 ± 0.05 0.94 ±0.04 

CSVT 0.90 ± 0.02 ± 0.07 0.94 ± 0.01 ± 0.09 0.92 ±0.07 



A continuous function for the scale factors is required in physics analyses that use b-jet tagging 
discriminators with multivariate methods. The function is obtained from a linear fit to the 
distribution of the scale factors measured with the FTC method. This is offset vertically to 



match the weighted mean of the medium operating point, as illustrated in Fig. 19 
7.3 Comparison of results 

The px-dependent scale factors measured in dijet and multijet events have been compared by 
reweighted them to match the jet px spectrum observed in tt events. The results are shown in 



Table 10 and are in good agreement with each other. This justifies the assumption that the scale 



factors for the muon jets and inclusive jets are compatible. 

8 Misidentification probability measurement 

The measurement of the misidentification probability for light-parton jets relies on the defi- 
nition of inverted tagging algorithms, selecting non-b jets using the same variables and tech- 
niques as the standard versions. These "negative taggers" can be used in the same way as the 
regular b-jet tagging algorithms both in data and in the simulation. As the negative-tagged 
jets are enriched in light flavours, the misidentification probability can be measured from data, 
with the simulation used to extract a correction factor. 

The misidentification probability is evaluated from tracks with a negative impact parameter or 
from secondary vertices with a negative decay length (see Section [4]). When a negative tagger 
is applied to jets of any flavour, the corresponding tagging efficiency is denoted "negative tag 
rate". The negative and positive b-jet tagging discriminator distributions in data are compared 



with the simulation in Fig. 20 The events are selected by requiring jet triggers with a px thresh- 
old of 30GeV/c, corresponding to an average px over all jets in the events of 44GeV/c. For all 
b-jet tagging algorithms, the data and simulation are found to be in agreement to within about 
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CMS, 2.3 fb" 1 at is = 7 TeV 
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Figure 18: Measured b-jet tagging efficiency as a function of the flavour discriminator threshold 
for the CSV algorithm, measured with the (a) FTC method, (b) FTM method and (c) bSample 
method. The absolute b-jet tagging efficiencies measured from data and predicted from simu- 
lation are shown in the upper histograms of each panel. The scale factors Sfj, are shown in the 
lower histogram, where the blue dashed lines represent the combined statistical and systematic 
uncertainty. The arrows indicate the standard operating points. For the FTC method, the red 
line represents a linear function fitted on the distribution of the scale factors. 
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8 Misidentification probability measurement 



Table 9: The scale factors SI\, as measured using the PLR, FTC, FTM and bSample methods, 
and the weighted mean (WM). The uncertainties are the combined statistical and systematic 
uncertainty. 

b tagger PLR FTC FTM bSample WM 

~JPL 0.96 ±0.03 0.99 ±0.03 0.96 ± 0.03 0.96 ± 0.05 0.97 ± 0.03 

JBPL 0.97 ±0.03 1.00 ±0.05 0.96 ± 0.03 0.97 ± 0.05 0.98 ± 0.03 

TCHEL 0.96 ±0.03 0.94 ± 0.04 0.95 ± 0.03 0.92 ± 0.05 0.95 ± 0.03 

CSVL 1.00 ± 0.03 1.03 ± 0.03 0.99 ± 0.04 0.97 ± 0.05 1.01 ± 0.03 

JPM 0.95 ±0.03 0.95 ±0.04 0.93 ± 0.03 0.94 ± 0.06 0.95 ± 0.03 

JBPM 0.93 ±0.03 0.95 ±0.04 0.91 ± 0.03 0.93 ± 0.06 0.94 ± 0.03 

TCHEM 0.96 ±0.03 0.97 ± 0.04 0.96 ± 0.03 0.93 ± 0.06 0.96 ± 0.03 

TCHPM 0.94 ±0.03 0.93 ± 0.04 0.95 ± 0.04 0.92 ± 0.06 0.93 ± 0.03 

SSVHEM 0.95 ±0.03 0.98 ± 0.04 0.98 ± 0.04 0.95 ± 0.07 0.96 ± 0.03 

CSVM 0.97 ±0.03 0.98 ± 0.04 0.97 ± 0.04 0.95 ± 0.06 0.97 ± 0.03 

JPT 0.90 ±0.03 0.89 ±0.05 0.90 ± 0.03 0.95 ± 0.07 0.90 ± 0.03 

JBPT 0.90 ±0.03 0.88 ±0.05 0.90 ± 0.03 0.90 ± 0.07 0.89 ± 0.03 

TCHPT 0.93 ± 0.03 0.92 ± 0.05 0.94 ± 0.04 0.91 ± 0.07 0.93 ± 0.03 

SSVHPT 0.95 ±0.03 0.95 ± 0.04 0.96 ± 0.04 0.89 ± 0.07 0.95 ± 0.03 

CSVT 0.95 ±0.03 0.97 ± 0.05 0.96 ± 0.03 0.95 ± 0.06 0.96 ± 0.03 
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Figure 19: Scale factors measured with the PLR and FTC methods and weighted mean as a func- 
tion of the discriminator threshold for the CSV algorithm. The black function is derived from 
a fit to the values measured with the FTC method. The red function labelled "Final function" 
corresponds to the same function offset vertically to match the weighted mean of the medium 
operating point. The uncertainties are the combined statistical and systematic uncertainty. 



37 



Table 10: The efficiency scale factors Sft,, and their uncertainties, obtained in multijet and tt 
events for b jets in the expected pj range of tt events. 



b taseer 


Sfjj in multijet events 


SFfo in tt events 


TPT 


n qs + n n? 

\J.yO ZLL U.VJi 


n Q7 _i_ n no 


TRPT 


n no _|_ n no 


n qo _i_ n no 

U.70 ZLL U.UJ 


TPHFT 

1 V_ 1 1 LLL 


n QO _|_ f) f|? 
U.7U IL VJ.UZ. 


n on; _|_ n no 


pc\/T 

v_D V L 


n QQ -i- n n? 


i f|1 -1- n 


JPM 


92 ± 03 

KJ • ^ ^— — 1 — KJ • KJ \S 


95 ± 03 


JBPM 


0.92 ± 0.03 


0.94 ± 0.03 


TCHEM 


0.95 ± 0.03 


0.96 ± 0.03 


TCHPM 


0.94 ± 0.03 


0.93 ± 0.03 


SSVHEM 


0.95 ± 0.03 


0.96 ± 0.03 


CSVM 


0.95 ± 0.03 


0.97 ±0.03 


JPT 


0.87 ± 0.04 


0.90 ± 0.03 


JBPT 


0.87 ±0.05 


0.89 ± 0.03 


TCHPT 


0.91 ± 0.04 


0.93 ± 0.03 


SSVHPT 


0.92 ± 0.03 


0.95 ± 0.03 


CSVT 


0.91 ± 0.03 


0.96 ± 0.03 



±20%. Similar results are found for a sample of events selected by requiring jet triggers with 
a px threshold of 300 GeV/c, in which the average pj is 213 GeV/c. Depending on the prescales 
applied, the data correspond to an integrated luminosity of up to 5.0 fb _1 . 

The misidentification probability is evaluated as: 

£ data d = £ data - R %ht / ( 20 ) 

where eT t is the negative tag rate as measured in jet data, defined as the fraction of jets that are 
negatively tagged. Riight = £ mc ' £ MC a correction factor taken from simulation, is the ratio 
of the misidentification probability for light-parton jets to the negative tag rate for jets of all 
flavours in the simulation. 

The rate £^ ata depends on the numbers of c and b quarks in the negative-tagged jets (which tend 
to decrease Riight)/ on the residual differences between light-flavour quark and gluon jets, the 
number of tracks from other displaced processes (such as Kg and A decays, and interactions 
in the detector material), and mismeasured tracks (which tend to increase Riight)- Due to these 
contributions the simulation predicts ranges of Ru g ht/ for the different algorithms and jet pj 
values, of about 1.1 to 1.4, 1 to 2, and 1 to 4, for the loose, medium, and tight operating points, 
respectively. 

To compare the measured misidentification probability to that predicted by the simulation, a 
scale factor SFi ig h t is defined: 

cr c misid / „misid ("71 \ 

flight - Edata / £ MC • y ll > 

The following systematic effects on the misidentification probability based on negative tags are 
considered: 

• b and c fractions: The fraction of b-flavour jets has been measured in CMS to agree 
with the simulation within a ±20% uncertainty [|46| . A ±20% uncertainty is con- 
servatively estimated for the overall fraction of b and c jets. The b- and c-flavour 
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Figure 20: Signed b-jet tagging discriminators in data (dots) and simulation for light-parton jets 
(blue histogram, with a lighter colour for the negative discriminators), c jets (green histogram), 
and b jets (red histogram) for the (a) TCHE, (b) JP, (c) SSVHE, and (d) CSV algorithms. A jet- 
trigger px threshold of 30GeV/c is required for both data and simulation. The simulation is 
normalized to the number of entries in the data. Underflow and overflow entries are added to 
the first and last bins, respectively. 
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fraction is varied in the QCD multijet simulation, from which a systematic uncer- 
tainty on Riight is inferred. 

Gluon fraction: This affects both the misidentification probability in simulation and 
the overall negative tag rates. The average fraction of gluon jets depends on the 
details of the parton density and hadronization functions used in the simulation. An 
uncertainty of ±20% is extracted from the comparison of simulation with data |47). 

Long-lived Kg and A decays: The amount of reconstructed Kg and A are found to 
be larger in the data than in the simulation |48| . To estimate the uncertainty on Rh g ht 
due to the Kg and A contribution, the simulated jets are reweighted by factors of 
1.3 ± 0.3 and 1.5 ± 0.5, respectively, in order to match the observed yield of Kg and 
A in the data. The quoted uncertainties on the factors account for the pj depen- 
dence. The yield is varied accordingly and the inferred variation on Rn g ht is taken as 
a systematic uncertainty. 

Photon conversion and nuclear interactions: The rate of secondary interactions in 
the pixel detector layers has been measured with ±5% precision [29ll49j . The corre- 
sponding variation implies a systematic uncertainty on Riight- 

Mismeasured tracks: According to the simulation, jets with a reconstructed track 
not associated with a genuine charged particle also present an excess of positive over 
negative tags. To correct for residual mismeasurement effects, a ±50% variation on 
this contribution is taken into account in the systematic uncertainty on Ru g ht- 

Sign flip: Small differences in the angle between a track and the jet axis can lead 
to a change of the sign of the impact parameter ("sign flip") and modify the nega- 
tive tag rate. In order to quantify this effect the ratio of the number of negative to 
positive tagged jets is computed in a muon jet sample similar to the one described 
in Section |5j with a larger than 80% b purity. Data and simulation are found to be 
in good agreement. From the statistical uncertainty on the comparison, the absolute 
uncertainty on this ratio is estimated as 2%, 1%, and 0.5% for loose, medium, and 
tight operating points, respectively. This sign flip uncertainty can be translated into 
a systematic uncertainty on Riight- 

Pileup: The misidentification probability depends on the pileup model used in the 
simulation. The simulated events are reweighted in order to match the pileup rate 
in the data. Differences between Riight values obtained for different running periods 
are used to estimate the systematic uncertainty, which is about ±1% for all taggers. 

Event sample: Physics analyses use jets from different event topologies. For a given 
jet pj, the misidentification probability is different for the leading jet or if there are 
other jets with higher pj values in the same event. Measured misidentification scale 
factors for leading and subleading jets have a dispersion of about 7%. In addition, 
misidentification scale factors vary by 2-7%, depending on the tagger, for different 
running periods. These two uncertainties are added in quadrature to account for 
an uncertainty due to sample dependence. This is the dominant contribution to the 
overall systematic uncertainty on the misidentification probability. 



The systematic uncertainties are detailed in Table 11 for the various algorithms and for the 



example of the medium operating points in the jet pj range between 80 and 120 GeV/c. 



The measured misidentification probabilities and data/MC scale factors are presented in Figs. 21 



and 22 as a function of the jet pj for the JPL and CSVM taggers. For a jet pi of about 80 GeV/c the 



misidentification probabilities are close to 10% and 1% for the loose (JPL) and medium (CSVM) 



40 



8 Misidentification probability measurement 



Table 11: Relative systematic uncertainties on SFught for jet pj in the range 80-120 GeV/c. The 
columns correspond to the different sources of systematics in the order described in the text. 



b tagger 


b and 


gluon 


V u and 


mismeas. 


sign flip 


MC stat 


pileup and 


total 




c jets 




2 nd int. 








evt. sample 




JPM 


8.6% 


0.8% 


7.9% 


1.0% 


6.4% 


0.9% 


9.4% 


16.5% 


JBPM 


6.2% 


1.2% 


6.9% 


0.5% 


1.6% 


0.9% 


9.0% 


13.2% 


TCHEM 


4.5% 


0.8% 


6.2% 


1.2% 


5.1% 


0.7% 


8.0% 


12.4% 


TCHPM 


1.6% 


1.0% 


3.0% 


0.6% 


2.5% 


0.6% 


9.2% 


10.3% 


SSVHEM 


1.0% 


0.9% 


3.2% 


1.9% 


2.9% 


0.7% 


7.3% 


9.0% 


CSVM 


3.2% 


1.8% 


4.4% 


0.7% 


4.6% 


0.7% 


7.4% 


10.6% 



Table 12: Misidentification probabilities and the corresponding data/MC scale factors SFught for 
different algorithms and operating points for jet pj in the range 80-120 GeV/c. The statistical 
uncertainties are quoted for the misidentification probabilities, while both the statistical and 
the systematic uncertainties are given for the scale factors. 



b tagger 


misidentification probability 


S flight 


JPL 


0.1000 ± 0.0004 


0.99 ± 0.01 ± 0.10 


JBPL 


0.1019 ± 0.0004 


0.96 ± 0.01 ± 0.09 


TCHEL 


0.1989 ± 0.0005 


1.10 ±0.01 ±0.09 


CSVL 


0.1020 ± 0.0004 


1.10 ±0.01 ±0.09 


JPM 


0.0107 ± 0.0001 


1.03 ±0.01 ±0.17 


JBPM 


0.0110 ±0.0001 


0.95 ± 0.01 ± 0.13 


TCHEM 


0.0282 ± 0.0003 


1.21 ±0.01 ±0.15 


TCHPM 


0.0304 ± 0.0003 


1.24 ± 0.01 ± 0.13 


SSVHEM 


0.0208 ± 0.0002 


0.94 ± 0.01 ± 0.08 


CSVM 


0.0151 ± 0.0002 


1.11 ±0.01 ±0.12 


JPT 


0.00116 ±0.00005 


1.03 ±0.04 ±0.25 


JBPT 


0.00117 ±0.00004 


0.95 ±0.04 ±0.19 


TCHPT 


0.00284 ± 0.00009 


1.26 ±0.04 ±0.21 


SSVHPT 


0.00207 ± 0.00009 


1.02 ±0.04 ±0.17 


CSVT 


0.00120 ± 0.00005 


1.17 ±0.05 ±0.21 



selections, respectively. Both algorithms show an increase of the misidentification probability 
with jet pj that can be explained by the higher track densities in collimated jets. The simulation 
reproduces this dependence to a large extent. The observed scale factors are close to one with 
a decrease of ~10% toward the highest jet pj. The misidentification probabilities measured 
with data and the data/MC scale factors are given in Table 12 for jets with pj between 80 and 
120GeV/c. 
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Figure 21: For the JPL tagger: (a) misidentification probability in data (red squares) and simu- 
lation (blue dots); (b) scale factor for the misidentification probability. The last pj bin in each 
plot includes all jets with pj > 670 GeV/c. The solid curve is the result of a polynomial fit to the 
data points. The dashed curves represent the overall statistical and systematic uncertainties on 
the measurements. 
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Figure 22: Same as Fig. 21 but for the CSVM tagger. 
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9 Conclusions 

The CMS collaboration has developed a variety of algorithms that are used to identify jets that 
arise from the hadronization of bottom quarks. Early analyses relied on simple and robust 
techniques, based on the second or third highest impact parameter significance of the tracks 
associated to a jet, or the flight distance measured using a reconstructed secondary vertex. More 
recent analyses use algorithms with better performance that define a powerful discriminant 
from the combination of several variables. The use of these more advanced algorithms is made 
possible by the high degree of agreement achieved between data and simulation, and by the 
robustness of the algorithms against variations in the running conditions. 

The algorithms provide selections at several operating points. The efficiencies of the algorithms 
at these operating points have been measured with a number of methods using multijet and tf 
events. A differential measurement of the efficiency as a function of jet pj, from 30 to 670 GeV/c, 
has been carried out with the multijet sample. This information is used in analyses that require 
knowledge of the performance of b-jet tagging over a wide range of transverse momenta. The 
information is also helpful for analyses such as the measurement of the tt cross section in order 
to avoid the strong correlations that can occur if the efficiencies are inferred from the tf event 
sample itself. 

The tt sample provides inclusive results, which are suitable for measurements of top-quark 
properties and for the analysis of standard model processes with similar jet momentum spectra 
and multiplicities. The misidentification probability, that a light-parton jet is mistaken as a fa- 
quark jet, has been measured by applying inverted tagging algorithms to the multijet events. 

The most effective algorithm is the Combined Secondary Vertex tagger. Using a loose selection, 
CMS achieves b-jet tagging efficiencies of about 85%, for a light-parton misidentification prob- 
ability of 10%. This selection is suited for tt analyses. In analyses requiring higher b-jet purity, 
such as searches for supersymmetric particles, approximately 70% b-jet tagging efficiency is 
achieved, for a light-parton misidentification probability of only 1.5%. These values apply for 
jet transverse momenta typically observed in tt events. 

The measured b-jet tagging performance is quantified and implemented in CMS analyses by 
using scale factor corrections to the MC simulation. These scale factors have been used ex- 
tensively to enable studies over a wide range of event topologies that would otherwise not be 
possible due to limited statistics. The scale factors for the b-jet tagging efficiencies are measured 
with uncertainties of 2-4%, and 3-8%, in the jet p T range 30-320 GeV/c and 320-670 GeV/c re- 
spectively. The maximum deviation of these scale factors from unity is approximately 10%. 
The scale factors for light-parton jet misidentification probabilities are measured to a precision 
of 8-17% over the full pj range, and differ from unity by at most 25%. 

The b-jet identification techniques discussed in this paper have been used in more than 40 anal- 
yses published by CMS, including measurements of top-quark properties, the Higgs boson, and 
searches for signals of physics beyond the standard model. The reduction of the uncertainties 
on the b-jet tagging scale factors has enabled the CMS experiment to decrease the light-parton 
background to unprecedented levels while maintaining high b-jet tagging efficiency for a wide 
range of processes containing heavy-flavour jets. 
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Glossary 

bSample Method to measure the b-jet tagging efficiency in tt events from a b-enriched sample 

CSVL Combined Secondary Vertex algorithm at the loose operating point 

CSVM Combined Secondary Vertex algorithm at the medium operating point 

CSVT Combined Secondary Vertex algorithm at the tight operating point 

FTC Flavour Tag Consistency method for the measurement of the b-jet tagging efficiency in tt 
events 

FTM Flavour Tag Matching method for the measurement of the b-jet tagging efficiency in tt 
events 

IP3D (method) Method for the measurement of the b-jet tagging efficiency in multijet events 
based on the impact parameters of muons 

IP Impact parameter of a track 

JBPL Jet B Probability algorithm at the loose operating point 
JBPM Jet B Probability algorithm at the medium operating point 
JBPT Jet B Probability algorithm at the tight operating point 
JPL Jet Probability algorithm at the loose operating point 
JPM Jet Probability algorithm at the medium operating point 
JPT Jet Probability algorithm at the tight operating point 

LT (method) Lifetime Tagging method for the measurement of the b-jet tagging efficiency in 
multijet events 

PLR Profile Likelihood Ratio method for the measurement of the b-jet tagging efficiency in tf 
events 

PtRel (method) Method for the measurement of the b-jet tagging efficiency in multijet events 
based on the transverse momenta of muons w.r.t. the jet axis 

PV Primary Vertex (proton-proton interaction point) 

SIP Significance of the impact parameter of a track 

SSVHEM Simple Secondary Vertex High Efficiency algorithm at the medium operating point 
SSVHPT Simple Secondary Vertex High Efficiency algorithm at the tight operating point 
SV Secondary Vertex (decay vertex of a long-lived particle) 
TC Track Counting (TCHE and TCHP) algorithms 

TCHEL Track Counting High Efficiency algorithm at the loose operating point 
TCHEM Track Counting High Efficiency algorithm at the medium operating point 
TCHPM Track Counting High Purity algorithm at the medium operating point 
TCHPT Track Counting High Efficiency algorithm at the tight operating point 
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